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NOVEL GENES ENCODING WHEAT STARCH SYNTHASES 

AND USES THEREFOR 

FIELD OF THE INVENTION 

5 The present invention relates generally to isolated nucleic acid molecules encoding 
wheat starch synthase enzymes and more particularly, to isolated nucleic acid 
molecules that encode wheat SSII and SSIII enzyme activities. The isolated nucleic 
acid molecules provide the means for modifying starch content and composition in 
plants, for example the ratio of amyloseiamylopectin in the starch granule of the 

10 endosperm during the grain-filling phase of endosperm development. The isolated 
nucleic acid molecules of the present invention also provide the means for screening 
plant lines to determine, the presence of natural and/or induced mutations in starch 
synthase genes which affect starch content and/or composition. The isolated nucleic 
acid molecules of the present invention further provide for the screening-assisted 

15 breeding of plants having desirable starch content and/or composition, in addition to 
providing for the direct genetic manipulation of plant starch content and/or composition. 

GENERAL 

Bibliographic details of the publications numerically referred to in this specification are 
20 collected at the end of the description. Reference herein to any published document 
is not to be taken as an indication or admission that any such published document is 
part of the common general knowledge or background information of a skilled worker 
in the relevant field. 

25 This specification contains nucleotide and amino acid sequence information (SEQ ID 
NOS:) prepared using the programme Patentln Version 2.0, presented herein at the 
end of the specification. Each nucleotide or amino acid sequence is identified in the 
sequence listing by the numeric indicator <210> followed by the sequence identifier 
(e.g. <210>1, <210>2, etc). The length, type of sequence (DNA, protein (PRT). etc) 

30 and source organism for each nucleotide or amino acid sequence are indicated by 
information provided in the numeric indicator fields <211>, <212> and <213>, 
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respectively. Nucleotide and amino acid sequences (SEQ ID NOs:) referred to in the 
specification are defined by the information provided in numeric indicator field <400> 
followed by the sequence identifier (eg, SEQ ID NO: 1 is <400>1, etc). 

5 The designation of nucleotide residues refen-ed to herein are those recommended by 
the lUPAC-lUB Biochemical Nomenclature Commission, wherein A represents 
Adenine, C represents Cytosine. G represents Guanine, T represents thymine, Y 
represents a pyrimidine residue, R represents a purine residue, M represents Adenine 
or Cytosine, K represents Guanine or Thymine, S represents Guanine or Cytosine, W 
10 represents Adenine or Thymine, H represents a nucleotide other than Guanine, B 
represents a nucleotide other than Adenine, V represents a nucleotide other than 
Thymine, D represents a nucleotide other than Cytosine and N represents any 
nucleotide residue. 

15 The designations for natural I y-occum'ng amino acid residues referred to herein are set 
forth in Table I, The designations for a non-limiting set of non-naturallyoccurring amino 
acids is listed in Table 2, 

As used herein the term "derived from" shall be taken to indicate that a specified 
20 integer rnay be obtained from a particular source albeit not necessarily directly from 
that source. 

Throughout this specification, unless the context requires otherwise, the word 
"comprise", or variations such as "comprises" or "comprising", will be understood to 
25 imply the inclusion of a stated step or element or integer or group of steps or elements 
or integers but not the exclusion of any other step or element or integer or group of 
steps or elements or integers. 
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TABLE 1 



Amino Acid Three-letter Code One-letter Code 



5 Alanine 


Ala 


A 


Arginine 


Arg 


R 


Asparagine 


Asn 


N 


Aspartic acid 


Asp 


n 


Cysteine 


Cys 


c 


10 Glutamine 


Gin 


o 


Glutamic acid 


Glu 


p 


Glycine 


Gly 


G 


Histidine 


His 


H 


Isoleucine 


He 


I 


15 Leucine 


Leu 


L 


Lysine 


Lys 


K 


Methionine 


Met 


. M 


Phenylalanine 


Phe 


17 


Proline 


Pro 


P 


20 Serine 


Ser 


S 


Threonine 


Thr 


T 


Tryptophan 


Trp 


W 


Tyrosine 


Tyr 


Y 


Valine 


Val 


V 


25 Aspartate/glutamate 


Baa 


B 


Asparagine/glutamine 






Any amino acid as above 


Xaa 


X 
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TABLE 2 



Non-conventional 
amino acid 



Code 



Non-conventional 
amino acid 



Code 



a-aminobutyric acid 

a-amino-a-methylbutyrate 

aminocyclopropane- 

carboxylate 
1 0 aminoisobutyric acid 

aminonorbornyl- 

carboxylatc 

cyclohexylalanine 

cyclopentylalanine 
1 5 D-alanine 

D-arginine 

D-aspartic acid 

D-cysteine 

D-glutamine 
20 D-glutamic acid 

D-histidine 

D-isoleucine 

D-leucine 

D-lysine 
25 D-methionine 

D-ornithine 

D-phenylalanine 

D-proline 

D-serine 
30 D-threonine 

D-tryptophan 



Abu 

Mgabu 

Cpro 



Aib 
Norb 



Chexa 

Cpen 

Dal 

Darg 

Dasp 

Dcys 

Dgln 

Dglu 

Dhis 

Dile 

Dleu 

Dlys 

Dmet 

Dorn 

Dphe 

Dpro 

Dser 

Dthr 

Dtrp 



L-N-methylalanine 

L-N-methylarginine 

L-N-methylasparagine 

L-N-methylaspartic acid 

L-N-methylcysteine 

L-N-methylglutamine 

L-N-methylglutamic acid 

L-N-methylhistidine 

L-N-methylisoIleucine 

L-N-methylleucine 

L-N-methyllysine 

L-N-methylmethionine 

L-N-methylnorleucine 

L-N-methylnorvaline 

L-N -methy lornithine 

L-N-methylphenylalanine 

L-N-methylprol ine 

L-N-methylserine 

L-N-methylthreonine 

L-N-methyltryptophan 

L-N-methyltyrosine 

L-N-methylvaline 

L- N-methy lethy 1 g 1 y c ine 

L-N-methyl-t-butylglycine 

L-norleucine 

L-norvaline 



Nmala 

Nmarg 

Nmasn 

Nmasp 

Nmcys 

Nmgln 

Nmglu 

Nmhis 

Nmile 

Nmleu 

Nmlys 

Nmmet 

Nmnle 

Nmnva 

Nmorn 

Nmphe 

Nmpro 

Nmser 

Nmthr 

Nmtrp 

Nmtyr 

Nmval 

Nmetg 

Nmtbug 

Nle 

Nva 
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Dtyr 
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a-methyNaminoisobutyrate 


Maib 


D-valine 


Dval 


a-methyl-y-aminobutyrate 


Mgabu 


D-a-methylalanine 


Dmala 


a-methylcyclohexylalanine 


Mchexa 


D-a-methylarginine 


Dmarg 


a-methyicylcopentylalanine 


Mcpen 


5 D-a-methylasparagine 


Dmasn 


a-methyl-a-napthylalanine 


Manap 


D-a-methylaspartate 


Dmasp 


a-methylpenicillamine 


Mpen 


D-a-methylcysteine 


Dmcys 


N-(4-aminobutyl)gIycine 


Nglu 


D-a-methylglutamine 


Dmgln 


N-(2-aminoethyl)glycine 


Naeg 


D-a-methylhistidine 


Dmhis 


N-(3 -aminopropy l)gly cine 


Norn 


10 D-a-methylisoleucine 


Dmile 


N-amino-a-methylbutyrate 


Nmaabu 


D-a-methyllcucine 


Dmieu 


a-napthylalanine 


Anap 


D-a-mcthyllysine 


Dmlys 


N-benzylglycine 


Nphe 
* 


D-a-methylmethionine 


Dmmet 


N-(2-carbamyiethyl)glycine 


Ngln 


D-a-methylorni thine 


Dmorn 


N-(carbamylmethyl)gIycine 


Nasn 


1 5 D-a-methylpheny lalanine 


Dmphe 


N-(2-carboxyethyl)glycine 


Nglu 


D-a-methylproline 


Dmpro 


N-(carboxymethy 1) glycine 


Nasp 


D-a-methylserine 


Dmser N-cyclobutylglycine 


Ncbut 


D-a-methylthreonine 


Dmthr 


N-cycloheptylglycine 


Nchep 


D-a-methyltryptophan 


Dmtrp 


N-cyclohexylgiycine 


Nchex 


20 D-a-methyityrosine 


Dmty 


N-cyclodecylglycine 


Ncdec 


D-a-methylvaline 


Dmval 


N-cylcododecylglycine 


Ncdod 


D-N-methylalanine 


Dnmala 


N-cyclooctylglycine 


Ncoct 


D-N-methylarginine 


Dnmarg 


N-cyclopropylglycine 


Ncpro 


D-N-methylasparagine 


Dnmasn 


N-cycloundecylglycine 


Ncund 


25 D-N-methylaspartate 


Dnmasp 


N-(2a-<iiphenyiethyl) 








glycine 


Nbhm 


D-N-methylcysteine 


Dnxncys 


N-(3,3-diphenylpropyl) 








glycine 


Nbhe 
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D-N-methylglutamine 



Dnmgln 



D-N-methylglutamate 
D-N-methylhistidine 
5 D-N-methylisoleucine 



Diunglu 
Dnmhis 
Dnmile 



D-N-methylleucine 



Dnmleu 



D-N-methyllysine 



Dnmlys 



10 N-methylcyclohexylalanine Nmchexa 



D-N-methylornithine 
N-methylglycine 
N-methylaminoisobutyrate 
N-( 1-methylpropyl) glycine 

15 N-(2-methylpropyl)glycine 
D -N-methy Itry p tophan 
D-N-methyltyrosine 
D-N-methylvaline 
Y-aminobutyric acid 

20 L-/-butylglycine 
L-ethylglycine 
L-homophenylalanine 
L-a-methylarginine 
L-a-methylaspartate 

25 L-a-methylcysteine 
L-a-methylglutamine 
L-a-methylhistidine 



L-a-methylisoleucine 



30 



Diunorn 

Nala 

Nmaib 

Nile 

Nleu 

Diimtrp 

Dnmtyr 

Dnmval 

Gabu 

Tbug 

Etg 

Hphe 

Marg 

Masp 

Mcys 

Mgln 

Mhis 



Mile 



L-a-methylleucine 



Mleu 



N-(3-guanidinopropy 1) 
glycine Narg 
N-( 1 -hydroxy ethyl)gly cine Nthr 
N-(hydroxyethyl))glycine Nser 
N-(imida20lylethyl)) 
glycine Nhis 
N-(3-indolylyethyl) 
glycine 

N-methyl-Y-aminobutyrate 
D-N-methylmethionine 
N-methylcyclopentylalanine 
D-N-methylphenylalanine 
D-N-methylproline 
D-N-methylserine 
D-N-methylthreonine 
N-(l-methylethyl)glycine 
N-methyla-napthylalanine 
N-methylpenicillamine 
N-(p-hydroxyphenyl)glycine 
N-(thiomethyl)glycine 
penicillamine 
L-a-methylalanine 
L-a-methylasparagine 
L-a-methyl-f-butylglycine 
L-methylethylglycine 
L-a-methylglutamate 
L- a -me thy 1 homo 
phenylalanine 
N-(2-methylthioethyl) 
glycine Nmet 
L-a-methyllysine Mlys 



Nhtrp 
Nmgabu 
Dnmmet 
Nmcpen 
Dnmphe 
Dnmpro 
Dnmser 
Dnmthr 
Nval 
Nmanap 
Nmpen 
Nhtyr 
Ncys 
Pen 
Mala 
Masn 
Mtbug 
Metg 



Mgiu 



Mhphe 
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L-a-methylmethionine Mmet 

L-a-methy Inorval ine Mnva 

L-a-methylphenylalanine Mphe 

L-a-methylserine Mser 

5 L-a-methyltryptophan Mtrp 

L-a-methylvaline Mval 

N-(N-(2,2-dipheiiylethyl) 

carbamylmethyOglycine Nnbhm 
10 l-carboxy-l-(2,2Kiiphenyl- 

cthylamino)cyclopropane Nmbc 



Those skilled in the art will appredate that the invention described herein is susceptible 
15 to variations and modifications other than those specifically described. It is to be 
understood that the invention includes all such variations and modifications. The 
invention also includes all of the steps, features, compositions and compounds 
referred to or indicated in this specification, individually or collectively, and any and all 
combinations or any two or more of said steps or features. 

20 

The present invention is not to be limited in scope by the specific embodiments 
described herein, which are intended for the purposes of exemplification only. 
Functionally-equivalent products, compositions and methods are cleariy within the 
scope of the invention, as described herein. 

25 

BACKGROUND TO THE INVENTION 

The biosynthesis of the starch granule is a complex process which involves the action 
of an array of isoforms of enzymes involved in the starch biosynthesis. Following the 
formation of glucose-1 -phosphate, the enzyme activities required for the synthesis of 
30 granular starch include ADP glucose pyrophosphorylase (EC 2.7.7.27), starch 
synthases (EC 2.4.1.21), branching enzymes (EC 2.4.1.18) and debranching enzymes 
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L-a-methy Inorleuc ine Mnle 

L-a-methylornithine Morn 

L-a-methy Iproline Mpro 

L-a-methylthreonine Mthr 

L-a-methyltyrosine Mtyr 
L-N-methy Ihomo 

phenylalanine Nmhphe 
N-(N-(3,3-diphenylpropyl) 

carbamylmethyl)giycine Nnbhe 
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(EC 3.2.1.41 and EC 3.2.1.68) (Mouille et aL, 1996). Plants contain isozymes of each 
of these activities, and the definition of these isoforms and their roles has been 
conducted through investigation of the properties of the suite of soluble enzymes found 
in the stroma of the plastid, analysis of the proteins entrapped within the matrix of the 
5 starch granule, and mutational studies to identify genes and define linkages between 
individual genes and their specific roles- 

Starch synthases extend regions of a-1 ,4 glucan through the transfer of the glucosyl 
moiety of ADPglucose to the non-reducing end of a pre-existing a>1,4 glucan. In 

10 addition to GBSS, 3 other classes of starch synthase have been identified in plants, 
SSI (wheat, Li et aL, 1999 and GenBank Accession No. U48227; rice, Baba et a/., 
1993; potato, Genbank Accession No. STSTASYNT), SSII (pea. Dry et ai 1992; 
potato, Edwards et aL, 1995; maize, Harn et ai 1998 and GenBank Accession No. 
U66377) and SSIII (potato, Abel ef a/., 1996; maize, Gao et aL, 1998), In the cereals, 

15 the most comprehensively studied species Is maize, where in addition to GBSS. 
cDNAs encoding SSI, SSI la, and SSI lb have been isolated, and both cDNA and 
genomic clones for da//1 have been characterised (Knight et ai, 1998; Harn et aL, 
1998; Gao ef a/., 1998). In maize, the product of the dul gene is known as maize 
SSII, however this gene is the homologue of potato SSIII. 

20 

The proteins within the matrix of the wheat starch granule have been extensively 
studied (Denyer ef a/., 1995; Rahman et aL, 1995; Takaoka et aL, 1997; Yamamori 
and Endo, 1996) and 60, 75. 85. 100, 104 and 105 kDa protein bands can be 
visualised following SDS-PAGE. The predominant 60 kDa protein is exclusively 

25 granule-bound and is analogous to the "waxy" granule bound starch synthase (GBSS) 
gene in maize (Rahman ef a/., 1995). The combination of three null alleles for this 
enzyme from each of the wheat genomes (Nakamura et aL, 1995) results in the 
amylose-free Vaxy" phenotype found in other species The 75 kDa starch synthase I 
(wSSI) is found in both the granule and the soluble fraction of wheat endosperm 

30 (Denyer et aL, 1995; Li et aL, 1999) and has been assigned to chromosomes 7A, 78 
and 70 (Yamamori and Endo, 1996; Li ef a/.. 1999). The 85 kDa band contains a 
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class II branching enzyme and an unidentified polypeptide (Rahman ef a/., 1995). The 
100. 104 and 105 kDa proteins of the wheat starch granule (designated Sgp-BI, Sgp- 
D1 and Sgp-AI by Yamamori and Endo, 1996) have been shown to be encoded by a 
homeologous set of genes on the short arm of chromosome 7B, 7A and 7D 
5 respectively (Yamamori and Endo, 1996; Takaoka etaL, 1997). Denyer et al. (1995) 
concluded on the basis of enzyme activity assays that these proteins were also starch 
synthases. These genes are referred to hereinafter as the "wheat SSI I genes". 

While GBSS has been established to be essential for amylose synthesis, the remaining 
10 starch synthases are thought to be primarily responsible for the elongation of 
amylopectin chains, although this does not preclude them from also having non- 
essential roles in amylose biosynthesis. Differences in kinetic properties between 
isoforms, and the analysis of mutants lacking various isoforms, suggests that each 
isoenzyme contributes to the extension of specific subsets of the available non- 
15 reducing ends. 

SUMMARY OF THE INVENTION 

The production of plants that produce improved starches that are modified for 
particular end-use applications, such as, for example, starches having high or low 
20 amyloseiamylopectin ratios, requires the availability of genes encoding the various 
starch synthase isoforms. Because of species-specific codon usages, and variations 
in the kinetic parameters of the starch synthase isoforms between species, the 
production of modified starches may require the use of genes derived from particular 
species. 

25 

Furthermore, the screening-assisted breeding of plants having desirable starch content 
and/or composition requires specific gene sequences to be provided that can be used 
to distinguish between different homeologous genes encoding the various isoforms of 
wheat starch synthases, such as. for example, to identify and distinguish between 
30 naturally-occurring variant gene sequences. It is a particular object of the present 
invention to provide gene sequences to facilitate the screening-assisted selection of 
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wheat plants having starch traits which are associated with the presence and/or 
expression of one or more wheat SSI and/or SSIII genes. 

Accordingly, the present invention provides isolated nucleotide sequences encoding 
5 the wheat SSII (i.e. wSSII) and wheat SSIII (i.e. wSSIII) isoenzymes, and DNA markers 
derived therefrom. The present invention further facilitates the production of 
transformed plants carrying these nucleotide sequences. 

More particularly, the present invention provides isolated nucleic acid molecules 
10 encoding the 100, 104 and 105 kDa SSII (Sgp-1) polypeptides of the wheat starch 
granule matrix, as detemnined using the SDS/PAGE system of Rahman et al. (1995), 
which polypeptides are equivalent to the 100, 108 and 115 kDa polypeptides described 
by Yamamori and Endo (1 996). 

15 The present invention further provides isolated nucleic acid molecules encoding the 
soluble dulh-type wheat starch synthase III polypeptide. Analysis of the polypeptides 
encoded by these nucleic acid molecules reveals several consensus amino acid 
sequence motifs that are highly conserved in wheat starch synthase isoenzymes, in 
addition to isoenzyme-specific sequences, which sequences possess utility in isolating 

20 related starch synthase-encoding sequences and in assaying plants for their 
expression of one or more starch synthase isoenzymes. 

Accordingly, one aspect of the present invention provides an isolated nucleic acid 
molecule which comprises a sequence of nucleotides which encodes, or is 
25 complementary to a nucleic acid molecule which encodes a wheat starch synthase 
polypeptide, protein or enzyme molecule or a functional subunit thereof selected from 
the following: 

(i) a wheat starch synthase II (wSSII) polypeptide, protein or enzyme or 
functional subunit thereof which comprises an amino acid sequence which is at 
30 least about 85% identical overall to an amino acid sequence set forth in any one 

of SEQ ID NOS: 2, 4. or 6; 
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(ii) a wheat starch synthase III (wSSIII) polypeptide, protein or enzyme or 
functional subunlt thereof which comprises an amino acid sequence which Is at 
least about 85% identical overall to an amino acid sequence set forth in any one 
of SEQ ID NOS: 8 or 10; 
5 (lii) a wheat starch synthase, polypeptide, protein or enzyme or functional 

subunit thereof which comprises a conserved amino acid sequence having at 
least 25% identity to an amino acid sequence selected from the group 
consisting of: 

(a) KVGGLGDWTS (SEQ ID NO: 39); 
10 (b) GHTVEVILPKY (SEQ ID NO: 40); 

(c) HDWSSAPVAWLYKEHY (SEQ ID NO: 41); 

(d) GILNGIDPDIWDPYTD (SEQ ID NO: 42); 

(e) DVPIVGIITRLTAQKG (SEQ ID NO: 43); 
(0 NGQWLLGSA (SEQ ID NO: 44); 

15 (g)AGSDFIIVPSIFEPCGLTQLVAMRYGS (SEQ ID NO: 45); and 

(h)TGGLVDTV (SEQ ID NO: 46); 
wherein said wheat starch synthase polypeptide further comprises an amino 
acid sequence having at least about 85% identity overall to an amino acid 
sequence set forth in any one of SEQ ID NOS: 2. 4, 6. 8 or 10; and 

20 (iv) a wheat starch synthase polypeptide, protein or enzyme or functional 

subunit thereof which comprises a conserved amino acid sequence having at 
least 25% identity to an amino acid sequence selected from the group 
consisting of: 

(a) KTGGLGDVAGA (SEQ ID NO: 47); 

25 (b) GHRVMVWPRY (SEQ ID NO: 48); 

(c) NDWHTALLPVYLKAYY (SEQ ID NO: 49); 

(d) GIVNGIDNMEWNPEVD (SEQ ID NO: 50); 

(e) DVPLLGFIGRLDGQKG (SEQ ID NO: 51 ); 

(f) DVQLVMLGTG (SEQ ID NO: 52); 

30 (g)AGADALLMPSRF(EA/)PGGLNQLYAMAYGT (SEQ ID NO: 53); and 

(h)VGG(V/L)RDTV (SEQ ID NO: 54); 
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wherein said wheat starch synthase polypeptide further comprises an amino 
acid sequence having at least about 85% identity overall to an amino acid 
sequence set forth in any one of SEQ ID NOS: 2, 4. 6. 8 or 1 0. 

5 In a preferred embodiment, the isolated nucleic acid molecule encodes a starch 
synthase polypeptide, protein or enzyme having at least about 90% amino acid 
sequence identity to any one of SEQ ID NOS: 2. 4. 6, 8 or 10, more preferably having 
at least about 95% or about 97% or about 99% identity to any one of said amino acid 
sequences. 

10 

In an alternative embodiment, the isolated nucleic acid molecule of the present 
invention encodes a wheat starch synthase polypeptide which comprises one or more 
amino acid sequences selected from the group consisting of: 

(a)GHTVEVILPKY; 
15 (b) HDWSSAPVAWLYKEHY; 

(c) DVPIVGIITRLTAQKG; 

(d) NGQWLLGSA; 

(e) AGSDFIIVPSIFEPCGLTQLVAMRYGS; 

(f) TGGLVDTV; 

20 (g) GIVNGIDNMEWNPEVD; and 

(h) AGADALLMPSRF(E/V)PCGLNQLYAMAYGT. 

in an alternative embodiment, the present invention provides an isolated nucleic acid 
molecule which encodes a wheat starch synthase polypeptide, protein or enzyme 
25 molecule or a functional subunit thereof, wherein said nucleic acid molecule comprises 
a nucleotide sequence having at least about 85% nucleotide sequence identity to any 
one of SEQ ID NOS: 1. 3, 5. 7. 9,11-16. 37 or 38 or a complementary nucleotide 
sequence. thereto. 

30 In a preferred embodiment, the isolated nucleic acid molecule comprises the 
nucleotide sequence set forth in any one of SEQ ID NOS: 1, 3. 5, 7, 9,11-16. 37 or 38, 
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or is at least about 90% identical, more preferably at least about 95% or 97% or 99% 
identical to all or a protein-encoding part thereof. 

In an alternative embodiment, the present invention provides an isolated nucleic acid 
5 molecule which encodes a wheat starch synthase polypeptide, protein or enzyme 
molecule or a functional subunit thereof, wherein said nucleic acid molecule comprises 
a nucleotide sequence that is capable of hybridising under at least moderate 
stringency hybridisation conditions to at least about 30 contiguous nucleotides derived 
from any one of SEQ ID NOS: 1, 3, 5, 7, 9,11-16, 37 or 38, or a complementary 
10 nucleotide sequence thereto, 

A second aspect of the present invention provides a method of isolating a nucleic acid 
molecule that encodes a starch synthase polypeptide, protein or enzyme described 
supra, said method comprising: 
15 (i) hybridising a probe or primer comprising at least about 15 contiguous 

nucleotides in length derived from any one of SEQ ID NOS: 1, 3. 5. 7, 9.1 1-16, 
37 or 38. or a complementary nucleotide sequence thereto to single-stranded 
or double-stranded mRNA, cDNA or genomic DNA; and 
(ii) detecting the hybridised mRNA, cDNA or genomic DNA using a detecting 
20 means. 

Preferably, the detecting means is a reporter molecule covalently attached to the probe 
or primer molecule or altematively, a polymerase chain reactio-i format. Accordingly, 
the present invention clearly extends to the use of the nucleic acid molecules provided 
25 herein to isolate related starch synthase-encoding sequences using standard 

"Si 

hybridisation and/or polymerase chain reaction techniques. 

A third aspect of the invention provides an isolated probe or primer comprising at least 
about 15 contiguous nucleotides in length derived from any one of SEQ ID NOS: 1 , 3, 
30 5, 7, 9.11-16. 37 or 38, or a complementary nucleotide sequence thereto. 
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Preferably, the probe or primer comprises a nucleotide sequence set forth in any one 
of SEQIDNOS:25 to 34. 

A fourth aspect of the present invention is directed to an isolated or recombinant starch 
5 synthase polypeptide, protein or enzyme, preferably substantially free of conspecific 
or non-specific proteins, which comprises an amino acid sequence selected from the 
following: 

(i) a wheat starch synthase II (wSSII) polypeptide, protein or enzyme or 
functional subunit thereof which comprises an amino acid sequence which is at 

1 0 least about 85% identical overall to an amino acid sequence set forth in any one 

of SEQ IDNOS:2. 4. or 6; 

(ii) a wheat starch synthase III (wSSIII) polypeptide, protein or enzyme or 
functional subunit thereof which comprises an amino acid sequence which is at 
least about 85% identical overall to an amino acid sequence set forth in any one 

15 of SEQ ID NOS: 8or 10; 

(iii) a wheat starch synthase polypeptide, protein or enzyme or functional 
subunit thereof which comprises a conserved amino acid sequence having at 
least 25% identity to an amino acid sequence selected from the group 
consisting of: 

20 (a) KVGGLGDWTS; 

(b) GHWEVILPKY; 

(c) HDWSSAPVAWLYKEHY; 

(d) GILNGIDPDIWDPYTD; 

(e) DVPIVGIITRLTAQKG; 
25 (f) NGQWLLGSA; 

(g) AGSDFIIVPSlFEPCGLTQLVAMRYGS; and 

(h) TGGLVDTV 

wherein said wheat starch synthase polypeptide further comprises an amino 
acid sequence having at least about 85% identity overall to an amino acid 
30 sequence set forth in any one of SEQ ID NOS: 2. 4, 6, 8 or 10; and 

(iv) a wheat starch synthase polypeptide, protein or enzyme or functional 
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subunit thereof which comprises a consen/ed amino acid sequence having at 
least 25% identity to an amino acid sequence selected from the group 
consisting of: 

(a) KTGGLGDVAGA; 
5 (b) GHRVMVWPRY; 

(c) NDWHTALLPVYLKAYY; 

(d) GIVNGIDNMEWNPEVD; 

(e) DVPLLGFIGRLDGQKG; 

(f) DVQLVMLGTG; 

10 (g)AGADALLMPSRF(EA/)PCGLNQLYAMAYGT; and 

(h)VGG(V/L)RDTV 

wherein said wheat starch synthase polypeptide further comprises an amino 
acid sequence having at least about 85% identity overall to an amino acid 
sequence set forth in any one of SEQ ID NOS: 2. 4, 6, 8 or 10. 

15 

The present invention clearly encompasses the mature protein region of a wheat 
starch synthase polypeptide which is obtained by removal of the N-terminal transit 
peptide sequence. 



20 A further aspect of the invention provides a method of assaying for the presence or 
absence of a starch synthase isoenzyme or the copy number of a gene encoding same 
in a plant, comprising contacting a biological sample derived from said plant with an 
isolated nucleic acid molecule derived from any one of SEQ ID NOS 1 , 3, 5, 7, 9,1 1- 
16, 37 or 38, or any one of SEQ ID NOS: 25 to 34, or a complementary nucleotide 

25 sequence thereto for a time and under conditions sufficient for hybridisation to occur 
and then detecting said hybridisation using a detection means. 

The detection means according to this aspect of the invention is any nucleic acid 
based hybridisation or amplification reaction. 

30 

A further aspect of the present invention utilises the above-mentioned assay method 



wo 00/66745 



PCT/AUOO/00385 



. 16- 

in the breeding and/or selection of plants which express or do not express particular 
starch synthase isoenzymes or alternatively, which express a particular starch 
synthase isoenzyme at a particular level in one or more plant tissues. This aspect 
clearly extends to the selection of transformed plant material which contains one or 
5 more of the isolated nucleic acid molecules of the present Invention. 

A further aspect of the present invention provides a method of modifying the starch 
content and/or starch composition of one or more tissues or organs of a plant, 
comprising expressing therein a sense molecule, antisense molecule, ribozyme 

10 molecule, co-suppression molecule, or gene-targeting molecule having at least about 
85% nucleotide sequence identity to any one of any one of SEQ ID NOS: 1 , 3, 5, 7, 
9,11-16, 37 or 38, or a complementary nucleotide sequence thereto for a time and 
under conditions sufficient for the enzyme activity of one or more starch synthase 
isoenzymes to be modified. This aspect of the invention clearly extends to the 

15 introduction of the sense molecule, antisense molecule, ribozyme molecule, co- 
suppression molecule, or gene-targeting molecule to isolated plant cells, tissues or 
organs or organelles by cell fusion or transgenic means and the regeneration of intact 
plants therefrom. 

20 A further aspect of the present invention provides an isolated promoter that is operable 
in the endosperm of a monocotyledonous plant cell, tissue or organ, and preferably in 
the endosperm of a monocotyledonous plant cell, tissue or organ. For example, the 
HMG promoter from wheat, or the maize zein gene promoter are particularly preferred, 
as is the promoter derived from a starch synthase gene of the present invention, such 

25 as a promoter that is linked in vivo to anyone of SEQ ID NOS 1, 3. 5. 7, 9,11-16, 37 
or 38, or a complementary nucleotide sequence thereto. 

A still further aspect of the present invention contemplates a transgenic plant 
comprising an introduced sense molecule, antisense molecule, ribozyme molecule, co- 
30 suppression molecule, or gene-targeting molecule having at least about 85% 
nucleotide sequence identity to anyone of any one of SEQ ID NOS: 1, 3, 5. 7. 9,11-16. 
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37 or 38, or a complementary nucleotide sequence thereto or a genetic construct 
comprising same, and to plant propagules. cells, tissues, organs or plant parts derived 
from said transgenic plant that also carry the introduced molecule(s). 

5 BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 is a copy of a photographic representation showing the distribution of wheat 
endosperm starch synthases between the starch granule and soluble fractions. Lane 
1 , SDS-PAGE of wheat endosperm starch granule proteins revealed by silver staining; 
lanes 2-7. immunoblot of wheat endosperm soluble phase and starch granule proteins 

10 separated by SDS-PAGE from various developmental stages and probed with an anti- 
(wheat wSSII peptide) monoclonal antibody. Lanes 2-4 contain proteins from the 
soluble fraction of wheat endosperm at 15 days post anthesis (Lane 2); 20 days post 
anthesis (Lane 3); and at 25 days post anthesis (Lane 4). Lanes 5-7 contain proteins 
from the starch granule of wheat endosperm at 15 days post anthesis (Lane 5); 20 

15 days post anthesis (Lane 6); and at 25 days post anthesis (Lane 7). 

Figure 2 is a copy of a schematic representation comparing the nucleotide sequences 
of cDNA clones designated wSSIIA, wSSIIB and wSSIID, encoding the starch 
synthase II polypeptides from wheat, using the PI LEU P programme of Devereaux et 
20 a/. (1984). 

Figure 3 is a copy of a schematic representation comparing the deduced amino acid 
sequences of starch synthase II from wheat (wSSIIA. wSSIIB and wSSIID). maize 
(maize SSIIa and maize SSIIb; Ham etal., 1998), pea (pea SSII; Dry et al 1992) and 

25 potato (potato SSII; van der Leij et al., 1991). Identical amino acid residues among 
each of these sequences are indicated below the sequences with "*". The alignments 
of maize SSIIa with maize SSIIb, and pea SSII and potato SSII are essentially as 
described in Harn et al. (1998) and Edwards et al. (1995). All sequences are aligned 
to position the transit peptide cleavage site below the arrow (E) between residues 59 

30 and 60 of the wSSIlA sequence. The wSSIIpl sequence, the sequence of SGP-B1 
(peptides), and of eight conserved regions are annotated and underlined. 



wo 00/66745 



- 18- 

Figure 4 is a cx)py of a photographic representation of a northern blot showing the 
expression of wheat wSSII mRNA in wheat plants. Total RNAs were isolated from 
leaves pre-anthesis florets and endosperm of the wheat cultivar "Gabo", grown under 

o 

a photoperiod comprising 16 hours daylength, and at 18 C during the day, and at 
5 13 C during the night cycle, and probed with the wSSIIp2 DNA fragment. The source 
of each RNA is indicated at the top of the Figure as follows: Lane 1 , leaf; Lane 2, pre- 
anthesis florets; Lanes 3-11, endosperm at: 4 days post-anthesis (Lane 3); 6 days 
post-anthesis (Lane 4); 8 days post-anthesis (Lane 5); 10 days post-anthesis (Lane 
6); 12 days post-anthesis (Lane 7); 15 days post-anthesis (Lane 8); 18 days post- 
10 anthesis (Lane 9); 21 days post-anthesis (Lane 10); and 25 days post-anthesis (Lane 
11). 

Figure 5 is a copy of a photographic representation showing the localization of wheat 
starch synthase II genes on the wheat genome by PGR, using the primers ssllc, sslld 

15 and sslle in the amplification reaction. The nullisomic-tetrasomic genomic DNA of 
wheat cv. Chinese Spring was used as template DNA. Lane D, Triticum tauschii; Lane 
AB, Accession line N7DT7B having no 7D chromosome and four copies of the 78 
chromosome; Lane AD, Accession line N7BT7A having no 78 chromosome and four 
copies of the 7A chromosome; Lane BD, Accession line N7AT7B having no 7A 

20 chromosome and four copies of the 78 chromosome; Lane ABD, wheat cv. Chinese 
Spring. PCR products derived from each cDNA clone are labelled. The results indicate 
that the cDNA clones, wSSIIB. wSSIIA and wSSIID are derived from the B-, A- and D- 
genomes of wheat, respectively. 

25 Figure 6 is a schematic representation showing the organisation of introns (lines) and 
exons (boxes) in the wheat SSII gene shown in SEQ ID NO: 37. The scale (bases), 
relative to the nucleotide sequence set forth in SEQ ID NO: 37, is provided at the 
bottom of the figure. 

30 Figure 7 is a schematic representation comparing the deduced amino acid Sequences 
of the maize, potato and wheat SSI II polypeptides. 
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Figure 8 is a copy of a photographic representation showing the expression of wheat 
wSSIII mRNA in wheat. Total RNAs were isolated fronn the endosperm of the wheat 
cultivars Wyuna (Panel a) and Gabo (Panel b) leaves pre-anthesis florets and 
endosperm of the wheat ojltivar "Gabo", grown under a photoperiod comprising 16 
5 hours daylength, and at 18 C during the day cycle, and at 13 C during the night cycle, 
and probed with the wSSIIIpl DNA fragment derived from wSSIII.B3 cDNA. The 
source of each RNA is indicated at the top of the Figure as follows: Lane 1 , endosperm 
at: 4 days post-anthesis; Lane 2, endosperm at 6 days post-anthesis; Lane 4, 
endosperm at 8 days post-anthesis; Lane 4, endosperm at 10 days post-anthesis; 
10 Lane 5, endosperm at 12 days post-anthesis; Lane 6, endosperm at 15 days post- 
anthesis; Lane 7, endosperm at 18 days post-anthesis; Lane 8, endosperm at 21 days 
post-anthesis; Lane 9. endosperm at 25 days post-anthesis; and Lane 10, endosperm 
at 31 days post-anthesis (Panel a only). In panel (c), L refers to leaf RNA, and P refers 
to RNA from pre-anthesis florets derived from the cultivar Gabo. 

15 

Figure 9 is a schematic representation showing the position of conserved amino acid 
sequences within four wheat starch synthase proteins. The eight highly-conserved 
regions between the wheat starch synthase polypeptides are underiined and annotated 
at the top of each group of amino acid sequences. The sequences included in the 
20 alignment are the wheat SSII-A1 and wheat SSIII polypeptides of the present 
invention; wheat GBSS (wGBSS; Yan et al., 1 999); wheat SSI (wSS1 ; Li et ai, 1 999); 
wheat SSII (wSS2; SEQ ID NO: 4); and wheat SSIII (wSS3; SEQ ID NO: 8). 

Figure 10 is a schematic representation showing the relationships between the 
25 primary amino acid sequences of starch synthases (SS) and glycogen synthase of £. 
CO// (GS). The dendrogram was generated by the program PILEUP (Devereaux et al., 
1984). The amino acid sequences used for the analysis are those of the wheat SSIIA, 
wheat SSIIB. wheat SSIID, and wheat SSIII polypeptides of the present invention 
compared to the deduced amino add sequences of wheat GBSS (Clark a/. , 1 991 ), 
30 wheat SSI (Li ef a/., 1999). rice GBSS (Okagaki, 1992), rice SSI (Baba et al. 1993). 
maize GBSS (Kloesgen etai, 1986), maize SSI (Knighted a/.. 1998), maize SSlia and 
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maize SSIIb (Harn ef a/., 1998), maize SSIII (Gao ef a/., 1998). pea GBSS (Dry etal.. 
1992). pea SSII (Dry et ai, 1992). potato GBSS (van der Leij et ai, 1991). potato SSI 
(Genbank accession number: STSTASYNT), potato SSII (Edwards etal., 1995), potato 
SSIII (Abel etaL, 1996). and E. co// glycogen synthase (GS) (Kumar etaL, 1986). Five 
5 groups of enzymes Included in the alignment are granule-bound starch synthase 
(GBSS), starch synthase-l (SSI), starch synthase-ll (SSII), starch synthase-lll (SSIII) 
and glycogen synthase (GS). 

Figure 11 is a schematic representation showing the position of conserved regions 
10 within cereal starch synthase genes. Comparisons of cereal starch synthases were 
made based on their deduced amino acid sequences and 8 conserved regions 
identified. Conser/ed regions are shown in bold and transit peptides (where defined) 
in grey. The sequences included in the alignment are the wheat SSII-A1 and wheat 
SSIII polypeptides of the present invention; wheat GBSS (Ainsworth et aL, 1993); 
15 wheat SSI (Li et aL, 1999); maize SSIIa (Ham et al., 1998); and maize dull-1(Gao et 
aL, 1998). 

Figure 12 is a copy of a schematic representation of a gene map showing the 
alignment of fragments 1 to 6 of the genomic SSIII gene (lower line) with the 
20 corresponding SSIII cDNA clone (upper line). Raised regions in the genomic clone 
fragments (lower line) represent protein-encoding regions of the gene. 

Figure 13 is a schematic representation showing the organisation of introns (lines) and 
exons (boxes) in the wheat SSIII gene shown in SEQ ID NO: 38. The scale (bases), 
25 relative to the nucleotide sequence set forth in SEQ ID NO: 38, is provided at the 
bottom of the figure. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

30 One aspect of the present invention provides an isolated nucleic acid molecule which 
comprises a sequence of nucleotides which encodes, or is complementary to a nucleic 



1 
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acid molecule which encodes a wheat starch synthase polypeptide, protein or enzyme 
molecule or a functional subunit thereof selected from the following: 

(i) a wheat starch synthase 11 (wSSII) polypeptide, protein or enzyme or 
functional subunit thereof which comprises an amino acid sequence set forth 

5 in any one of SEQ ID NOS: 2, 4, or 6; and 

(ii) a wheat starch synthase III (wSSIII) polypeptide, protein or enzyme or 
functional subunit thereof which comprises an amino acid sequence set forth 
in any one of SEQ ID NOS: 8 or 10. 

10 Alternatively or in addition, the isolated nucleic acid molecule of the present invention 
encodes a wheat starch synthase II (wSSII) polypeptide, protein or enzyme or 
functional subunit thereof and comprises a nucleotide sequence set forth in any one 
of SEQ ID NOS: 1 . 3. 5, or 37. 

15 Alternatively or in addition, the isolated nucleic acid molecule of the present invention 
encodes a wheat starch synthase III (wSSIII) polypeptide, protein or enzyme or 
functional subunit thereof and comprises a nucleotide sequence set forth in any one 
of SEQ ID NOS: 7, 9. or 38. 

20 As used herein, the term "starch synthase" shall be taken to refer to any enzymatically- 
active peptide, polypeptide, oligopeptide, polypeptide, protein or enzyme molecule that 
is at least capable of transferring a glucosyl moiety from ADP-glucose to an a-1,4- 
glucan molecule, or a peptide, polypeptide, oligopeptide or polypeptide fragment of 
such an enzymatically-active molecule. 

25 

The term "wheat starch synthase" refers to a starch synthase derived from hexaploid 
wheat or bariey or a progenitor species, or a relative thereto such as the diploid 
Triticum tauschii or other diploid, tetraploid, aneuploid, polyploid, nullisomic, or a 
wheat/bariey addition line, amongst others, the only requirement that the genomic DNA 
30 is at least about 80% identical to the genome of a wheat plant as determined by 
standard DNA melting curve analyses. 
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The term "starch synthase II" or "wSSII" or similar term shall be taken to refer to a 
starch synthase as hereinbefore defined that is detectable in the starch granule of a 
plant seed endosperm and possesses one or more properties selected from the group 
consisting of; 

5 (i) it is immunologically cross-reactive with the wheat starch granule 

proteins designated Sgp-B1 and/or Sgp-D1 and/or Sgp-A1 . having estimated 
molecular weights of about 85 kDa to about 115 kDa; 

(ii) it is encoded by one of a homeologous set of genes localised on wheat 
chromosomes 7B or 7A or 7D; 
10 (iii) it is encoded by a nucleotide sequence that comprises at least about 1 5 

nucleotides in length derived from any one or more of SEQ ID NOS: 1, 3, 5, or 
37 or a complementary nucleotide sequence thereto; 

(iv) it is encoded by a nucleotide sequence that is at least about 85% 
identical to one or more of the nucleotide sequences set forth in SEQ ID NOS: 

15 1, 3, 5, or 37, or a complementary nucleotide sequence thereto; 

(v) it comprises an amino acid sequence having at least about 85% identity 
to one or more of SEQ ID NOS: 2 or 4 or 6; 

(vi) it comprises at least about 5 contiguous amino acids, preferably at least 
about 10 contiguous amino acids, more preferably at least about 1 5 contiguous 

20 amino acids, even more preferably at least about 20 contiguous amino acids 

and still even more preferably at least about 25-50 contiguous amino acids of 
the amino acid sequences set forth in SEQ ID NOS: 2 or 4 or 6; 

(vii) it which comprises a conserved amino acid sequence having at least 
25% identity to an amino acid sequence selected from the group consisting of: 

25 (a) KVGGLGDVVTS; 

(b) GHTVEVILPKY; 

(c) HDWSSAPVAWLYKEHY; 

(d) GILNGIDPDIWDPYTD; 

(e) DVPIVGIITRLTAQKG; 
30 (f) NGQVVLLGSA; 

(g)AGSDFIIVPSIFEPCGLTQLVAMRYGS; and 
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(h)TGGLVDTV. 
in addition to any one or more of (i) to (vi); and 

(viii) it which comprises a conserved amino acid sequence having at least 
25% identity to an amino acid sequence selected from the group consisting of: 
5 (a) KTGGLGDVAGA; 

(b) GHRVMNAA/PRY; 

(c) NDWHTALLPVYLKAYY; 

(d) GIVNGIDNMEWNPEVD; 

(e) DVPLLGFIGRLDGQKG; 
10 (f) DVQLVMLGTG; 

(g) AGADALLMPSRF(EA/)PCGLNQLYAMAYGT; and 

(h) VGG(V/L)RDTV. 

in addition to any one or more of (i) to (vi). 

15 The term "starch synthase III" or "wSSIir or similar term shall be taken to refer to a 
starch synthase as hereinbefore defined that possesses one or more properties 
selected from the group consisting of: 

(i) it is encoded by a nucleotide sequence that comprises at least about 1 5 
nucleotides in length derived from anyone or more of SEQ ID NOS: 7, 9. 11- 

20 1 6, or 38, or a complementary nucleotide sequence thereto; 

(ii) it is encoded by a nucleotide sequence that is at least about 85% 
identical to one or more of the nucleotide sequences set forth in SEQ ID NOS: 
7, 9, 11-16, or 38, or a complementary nucleotide sequence thereto; and 

(iii) it comprises an amino acid sequence having at least about 85% identity 
25 to one or more of SEQ ID NOS: 8 or 10; 

(iv) it comprises at least about 5 contiguous amino acids, preferably at least 
about 1 0 contiguous amino acids, more preferably at least about 1 5 contiguous 
amino acids, even more preferably at least about 20 contiguous amino acids 
and still even more preferably at least about 25-50 contiguous amino acids of 

30 the amino acid sequences set forth in SEQ ID NOS: 8 or 10- 

(v) which comprises a conserved amino acid sequence having at least 25% 
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identity to an amino acid sequence selected from the group consisting of: 

(a) KVGGLGDNAATS; 

(b) GHTVEVILPKY; 

(c) HDWSSAPVAWLYKEHY; 
5 (d) GILNGIDPDIWDPYTD; 

(e) DVPIVGIITRLTAQKG: 

(f) NGQWLLGSA; 

(g) AGSDFIIVPSIFEPCGLTQLVAIV1RYGS; and 

(h) TGGLVDTV 

10 in addition to any one or more of (i) to (iv); and 

(vi) it which comprises a conserved amino acid sequence having at least 
25% identity to an amino acid sequence selected from the group consisting of: 

(a) KTGGLGDVAGA; 

(b) GHRVMVWPRY; 

15 (c) NDWHTALLPVYLKAYY; 

(d) GIVNGIDNMEWNPEVD; 

(e) DVPLLGFIGRLDGQKG; 

(f) DVQLVMLGTG; 

(g) AGADALLMPSRF(EA/)PCGLNQLYAMAYGT; and 
20 (h)VGG(V/L)RDTV, 

in addition to any one or more of (i) to (iv). 

In a more preferred embodiment, the WSSII or WSSIII polypeptide encoded by the 
nucleic acid molecule of the present invention will comprise a substantial contiguous 
25 region of any one of SEQ ID NOS: 2, 4, 6, 8 or 10 or 17 sufficient to possess the 
biological activity of a starch synthase polypeptide. 

For the purposes of nomenclature, the nucleotide sequence set forth in SEQ ID NO: 
1 relates to the cDNA molecule encoding the WSSII (i.e. Sgp-B1) polypeptide of 
30 wheat. The amino acid sequence of the corresponding polypeptide is set forth herein 
as SEQ ID N0:2. The nucleotide sequence set forth in SEQ ID NO: 3 relates to the 
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cDNA molecule encoding the WSSII (i.e. Sgp-AI) polypeptide of wheat. The amino 
acid sequence of the corresponding polypeptide is set forth herein as SEQ ID NO:4. 
The nucleotide sequence set forth in SEQ ID NO: 5 relates to the cDNA molecule 
encoding the WSSII (i.e. Sgp-D1) polypeptide of wheat. The amino acid sequence of 
5 the corresponding polypeptide is set forth herein as SEQ ID NO:6. The nucleotide 
sequences set forth in SEQ ID NOs: 7 and 9 relate, respectively, to full-length and 
partial cDNA molecules encoding the WSSIII polypeptide of wheat. The amino acid 
sequences of the con-esponding polypeptides are set forth herein as SEQ ID NOS: 8 
and 10, respectively. The nucleotide sequences set forth in SEQ ID NOs: 11 to 16 
10 relates to fragments of the genomic gene encoding the WSSIII polypeptide of wheat, ' 
significant protein-encoding regions of which are described by reference to Table 4 
and Figure 1 1 . The nucleotide sequence set forth in SEQ ID NO: 37 relates to the 
WSSII genomic gene of Triticum tauschii, corresponding to the WSSII gene of the D- 
genome of wheat, which encodes the WSSIII polypeptide. The nucleotide sequence 
15 set forth in SEQ ID NO: 38 relates to the wheat WSSIII genomic gene. 

Preferably, the isolated nucleic acid molecule of the present invention comprises a 
sequence of nucleotides which encodes, or Is complementary to a nucleic acid 
molecule which encodes a wheat starch synthase polypeptide, protein or enzyme 
20 molecule or a functional subunit thereof which comprises an amino acid sequence 
which is at least about 85% identical overall to an amino acid sequence set forth in any 
one of SEQ ID NOS: 2. 4. 6. 8. or 10 and more preferably, which additionally 
comprises which comprises one or more amino acid sequences selected from the 
group consisting of: 

25 (a) KVGGLGDVVTS; 

(b) GHTVEVILPKY; 

(c) HDWSSAPVAWLYKEHY; 

(d) GILNGIDPDIWDPYTD; 

(e) DVPIVGIITRLTAQKG; 
30 (f) NGQWLLGSA; 

(g)AGSDFIIVPSIFEPCGLTQLVAMRYGS; 
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(h) TGGLVDTV; 

(i) KTGGLGDVAGA; 
0) GHRVMVWPRY; 

(k) NDWHTALLPVYLKAYY; 
5 (I) GIVNGIDNMEWNPEVD; 

(m) DVPLLGFIGRLDGQKG; 
(n) DVQLVMLGTG; 

(o)AGADALLMPSRF(EA/)PCGLNQLYAMAYGT; and 
(p)VGG(V/L)RDTV. 

10 

The present invention clearly extends to homologues, analogues and derivatives of the 
wheat starch synthase 11 and III genes exemplified by the nucleotide sequences set 
forth herein as SEQ ID NOs: 1, 3, 5, 7, 9,11-16, 37 or 38. 

15 Preferred starch synthase genes may be derived from a naturally-occurring starch 
synthase gene by standard recombinant techniques. Generally, a starch synthase 
gene may be subjected to mutagenesis to produpe single or multiple nucleotide 
substitutions, deletions and/or additions. Nucleotide insertional derivatives of the 
starch synthase gene of the present invention include 5' and 3' terminal fusions as 

20 well as intra-sequence insertions of single or multiple nucleotides. Insertional 
nucleotide sequence variants are those in which one or more nucleotides are 
introduced into a predetermined site in the nucleotide sequence although random 
insertion is also possible with suitable screening of the resulting product. Deletional 
variants are characterised by the removal of one or more nucleotides from the 

25 sequence. Substitutional nucleotide variants are. those in which at least one nucleotide 
in the sequence has been removed and a different nucleotide inserted in its place. 
Such a substitution may be '^silent" in that the substitution does not change the amino 
acid defined by the codon. Alternatively, substituents are designed to alter one amino 
acid for another similar acting amino acid, or amino acid of like charge, polarity, or 

30 hydrophobicity. 

For the present purpose, "homologues" of a nucleotide sequence shall be taken to 
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refer to an isolated nucleic add molecule which is substantially the same as the nucleic 
acid molecule of the present invention or its complementary nucleotide sequence 
notwithstanding the occurrence within said sequence, of one or more nucleotide 
substitutions, insertions, deletions, or rearrangements. 



5 



"Analogues" of a nucleotide sequence set forth herein shall be taken to refer to an 
Isolated nucleic acid molecule which is substantially the same as a nucleic acid 
molecule of the present invention or its complementary nucleotide sequence, 
notwithstanding the occurrence of any non-nucleotide constituents not normally 
10 present in said isolated nucleic acid molecule, for example carbohydrates, 
radiochemicals including radionucleotides. reporter molecules such as. but not limited 
to DIG. alkaline phosphatase or horseradish peroxidase, amongst others. 

"Derivatives" of a nucleotide sequence set forth herein shall be taken to refer to any 
15 isolated nucleic acid molecule which contains significant sequence similarity to said 
sequence or a part thereof. Generally, the nucleotide sequence of the present 
invention may be subjected to mutagenesis to produce single or multiple nucleotide 
substitutions, deletions and/or Insertions. Nucleotide insertional derivatives of the 
nucleotide sequence of the present invention include 5' and 3' terminal fusions as well 
20 as intra-sequence insertions of single or multiple nucleotides or nucleotide analogues. 
Insertional nucleotide sequence variants are those in which one or more nucleotides 
or nucleotide analogues are introduced into a predetermined site in the nucleotide 
sequence of said sequence, although random insertion Is also possible with suitable 
screening of the resulting product being performed. Deletional variants are 
25 characterised by the removal of one or more nucleotides from, the nucleotide 
sequence. Substitutional nucleotide variants are those in which at least one nucleotide 
in the sequence has been removed and a different nucleotide or nucleotide analogue 
inserted in its place. 

30 The present invention extends to the isolated nucleic acid molecule when integrated 
into the genome of a cell as an addition to the endogenous cellular complement of 
starch synthase genes, irrespective of whether or not the introduced nucleotide 
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sequence is translatable or non-translatable to produce a polypeptide. The present 
invention clearly contemplates the introduction of additional copies of starch synthase 
genes into plants, particularly wheat plants, in the antisense orientation to reduce the 
expression of particular wheat starch synthase genes. As will be known to those skilled 
5 in the art. such antisense genes are non-translatable, notwithstanding that they can be 
expressed to produce antisense mRNA molecules. 

The said integrated nucleic acid molecule may, or may not, contain promoter 
sequences to regulate expression of the subject genetic sequence. 

10 

Accordingly, the present invention dearly encompasses preferred homologues. 
analogues and derivatives that comprise a sequence of nucleotides which encodes, 
or is complementary to a nucleic acid molecule which encodes a wheat starch 
synthase polypeptide, protein or enzyme molecule or a functional subunit thereof 
15 selected from the following: 

(i) a wheat starch synthase II (wSSII) polypeptide, protein or enzyme or 
functional subunit thereof which comprises an amino acid sequence which is at 
least about 85% identical overall to an amino acid sequence set forth in any one 
ofSEQIDNOS:2. 4. or6; 
20 (ii) a wheat starch synthase III (wSSIII) polypeptide, protein or enzyme or 

functional subunit thereof which comprises an amino acid sequence which is at 
least about 85% identical overall to an amino acid sequence set forth in any one 
orSEQ ID NOS: 8or 10; 

(iii) a wheat starch synthase polypeptide, protein or enzyme or functional 
25 subunit thereof which comprises a conserved amino acid sequence having at 

least 25% identity to an amino acid sequence selected from the group 
. consisting of: 

(a) KVGGLGDWTS; 

(b) GHTVEVILPKY; 

30 (c) HDWSSAPVAWLYKEHY; 

(d) GILNGIDPDIWDPYTD; 
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10 



(e) DVPIVGIITRLTAQKG; 

(f) NGQVVLLGSA; 

(g) AGSDFIIVPSIFEPCGLTQLVAMRYGS;and 

(h) TGGLVDtV 

and wherein said wheat starch synthase polypeptide further comprises an 
amino acid sequence having at least about 85% identity overall to an amino 
acid sequence set forth in any one of SEQ ID NOS: 2. 4, 6. 8 or 1 0; and 
(iv) a wheat starch synthase polypeptide, protein or enzyme or functional 
subunit thereof which comprises a conserved amino acid sequence having at 
least 25% identity to an amino acid sequence selected from the group 
consisting of: 

(a) KTGGLGDVAGA: 

(b) GHRVMWVPRY; 

(c) NDWHTALLPVYLKAYY; 
15 (d) GIVNGIDNMEWNPEVD; 

(e) DVPLLGFIGRLDGQKG; 

• ♦ 

(f) DVQLVMLGTG; 

(g) AGADALLMPSRF(EA/)PCGLNQLYAMAYGT; and 

(h) VGG(V/L)RDTV. 

20 and wherein said wheat starch synthase polypeptide further comprises an 

amino acid sequence having at least about 85% identity overall to an amino 
acid sequence set forth in any one of SEQ ID NOS: 2, 4, 6, 8 or 10, 

Preferably, the isolated nucleic acid molecule encodes a starch syrUhase polypeptide, 
25 protein or enzyme that comprises two, more preferably three, more preferably four, 
more preferably five, more preferably six. more preferably seven and even more 
preferably eight of the conserved amino acid motifs listed supra. Even more preferably, 
the said amino acid motifs are located in a relative configuration such as that shown 
for the wheat SSII or wheat SSIII polypeptides described herein. 

30 



In a preferred embodiment, the isolated nucleic acid molecule encodes a starch 
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synthase polypeptide, protein or enzyme having at least about 90% amino acid 
sequence identity to any one of SEQ ID NOS: 2. 4. 6. 8 or 10, more preferably having 
at least about 95% or about 97% or about 99% identity to any one of said amino acid 
sequences. 

5 

In an alternative embodiment, the present invention provides an isolated nucleic acid 
molecule which encodes a wheat starch synthase polypeptide, protein or enzyme 
molecule or a functional subunit thereof, wherein said nucleic acid molecule comprises 
a nucleotide sequence having at least about 85% nucleotide sequence identity to any 
10 one of SEQ ID NOS: 1, 3, 5. 7. 9.11-16. 37. or 38. or a degenerate nucleotide 
sequence thereto or a complementary nucleotide sequence thereto. 

By "degenerate nucleotide sequence" is meant a nucleotide sequence that encodes 
a substantially identical amino acid sequence as a stated nucleotide sequence, 

15 

In a preferred embodiment, the isolated nucleic acid molecule comprises the 
nucleotide sequence set forth in any one of SEQ ID NOS: 1, 3, 5, 7. 9,11-16, 37. or 38. 
or is at least about 90% identical, more preferably at least about 95% or 97% or 99% 
identical to all or a protein-encoding part thereof. 

20 

In an alternative embodiment, preferred homologues, analogues and derivatives of the 
nucleic acid molecule of the present invention encodes a wheat starch synthase 
polypeptide, protein or enzyme molecule or a functional subunit thereof and comprises 
a nucleotide sequence that is capable of hybridising under at least moderate 
25 stringency hybridisation conditions to at least about 30 contiguous nucleotides derived 
from any one of SEQ ID NOS: 1. 3, 5, 7, 9,11-16, 37. or 38, or a complementary 
nucleotide sequence thereto. 

For the purposes of defining the level of stringency, a low stringency is defined herein 
30 as being a hybridisation and/or a wash carried out in BxSSC buffer. 0.1% (w/v) SOS 
at 28°C. Generally, the stringency is increased by reducing the concentration of SSC 
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buffer, and/or increasing the concentration of SDS and/or increasing the temperature 
of the hybridisation and/or wash. A moderate stringency comprises a hybridisation 
and/or a wash carried out in 0.2 x SSC-2 x SSC buffer. 0.1% (w/v) SDS at 42 °C to 
65^C, while a high stringency comprises a hybridisation and/or a wash carried out in 
5 0.1xSSC-0.2 X SSC buffer, 0.1% (w/v) SDS at a temperature of at least 55°C. 
Conditions for hybridisations and washes are well understood by one normally skilled 
in the art. For the purposes of further clarification only, reference to the parameters 
affecting hybridisation between nucleic acid molecules is found in pages 2.10.8 to 
2.10.16. of Ausubel et al. (1987), which is herein incorporated by reference. 

10 

Those skilled in the art will be aware of procedures for the isolation of further wheat 
starch synthase genes to those specifically described herein or homologues, 
analogues or derivatives of said genes, for example further cDNA sequences and 
genomic gene equivalents, when provided with one or more of the nucleotide 

15 sequences set forth in SEQ ID NOs: 1, 3, 5, 7, 9,11-16, 37, or 38. In particular, 
amplifications and/or hybridisations may be performed using one or more nucleic acid 
primers or hybridisation probes comprising at least 10 contiguous nucleotides and 
preferably at least about 20 contiguous nucleotides or 50 contiguous nucleotides 
derived from the nucleotide sequences set forth herein, to isolate cDNA clones, mRNA 

20 molecules, genomic clones from a genomic library (in particular genomic clones 
containing the entire 5' upstream region of the gene including the promoter sequence, 
and the entire coding region and 3'-untranslated sequences), and/or synthetic 
oligonucleotide molecules, amongst others. The present invention clearly extends to 
such related sequences. 

25 

Accordingly, a second aspect of the present invention provides a method of isolating 
a nucleic acid molecule that encodes a starch synthase polypeptide, protein or enzyme 
said method comprising: 

(i) hybridising a probe or primer comprising at least about 1 5 contiguous 
30 nucleotides in length derived from any one of SEQ ID NOS 1, 3. 5. 7, 9,11-16, 

37, or 38, or a complementary nucleotide sequence thereto to single-stranded 

or double-stranded mRNA. cDNA or genomic DNA; and 
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(ii) detecting the hybridised mRNA, cDNA or genomic DNA using a detecting 
means. 

Preferably, the detecting means is a reporter nrKDiecule covalently attached to the probe 
5 or primer molecule or alternatively, a polymerase chain reaction format. 

An alternative method contemplated in the present invention involves hybridising two 
nucleic add "primer molecules" to a nucleic acid "template molecule" which comprises 
a related starch synthase gene or related starch synthase genetic sequence or a 

10 functional part thereof, wherein the first of said primers comprises contiguous 
nucleotides derived from any one or more of SEQ ID NOS: 1, 3, 5, 7, 9.1 1-16, 37, or 
38, and the second of said primers comprises contiguous nucleotides complementary 
to any one or more of SEQ ID NOS: 1, 3, 5, 7, 9,11-16. 37. or 38. Specific nucleic 
acid molecule copies of the template molecule are amplified enzymatically in a 

15 polymerase chain reaction, a technique that is well known to one skilled in the art. 

In a preferred embodiment, each nucleic acid primer molecule is at least 10 
nucleotides in length, more preferably at least 20 nucleotides in length, even more 
preferably at least 30 nucleotides in length, still more preferably at least 40 nucleotides 
20 in length and even still more preferably at least 50 nucleotides in length. 

Furthermore, the nucleic acid primer molecules consists of a combination of any of the 
nucleotides adenine, cytidine, guanine, thymidine, or inosine. or functional analogues 
or derivatives thereof which are at least capable of being incorporated into a 
25 polynucleotide molecule without having an inhibitory effect on the hybridisation of said 
primer to the template molecule in the environment in which it is used. 

Furthermore, one or both of the nucleic acid primer molecules may be contained in an 
aqueous mixture of other nucleic acid primer molecules, for example a mixture of 
30 degenerate primer sequences which vary from each other by one or more nucleotide 
substitutions or deletions. Alternatively, one or both of the nucleic acid primer 
molecules may be in a substantially pure form. 
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The nucleic acid template molecule may be in a recombinant form, in a virus particle, 
bacteriophage particle, yeast cell, animal cell, or a plant cell. Preferably, the nucleic 
acid template molecule is derived from a plant cell, tissue or organ, in particular a cell, 
tissue or organ derived from a wheat or barley plant or a progenitor species, or a 
5 relative thereto such as the diploid Triticum tauschii or other diploid, tetraploid. 
aneuploid. polyploid, nullisomic. or a wheat/barley addition line, amongst others. 

Those skilled in the art will be aware that there are many known variations of the basic 
polymerase chain reaction procedure, which may be employed to isolate' a related 
10 starch synthase gene or related starch synthase genetic sequence when provided with 
the nucleotide sequences set forth herein. Such variations are discussed, for example, 
in McPherson et al (1991). The present invention extends to the use of all such 
variations in the isolation of related starch synthase genes or related starch synthase 
genetic sequences using the nucleotide sequences embodied by the present invention. 

15 

As exemplified herein, the present inventors have isolated several wheat starch 
synthase genes using both hybridisation and polymerase chain reaction approaches, 
employing novel probes and primer sequences to do so. 

20 Accordingly, a third aspect of the invention provides an isolated probe or primer 
comprising at least about 1 5 contiguous nucleotides in length derived from any one of 
SEQ ID NOS: 1, 3. 5. 7, 9.11-16. 37. or 38. or a complementary nucleotide sequence 
thereto. 



25 Preferably, the probe or primer comprises a nucleotide sequence set forth in any one 
of SEQ ID NOS: 25 to 34. 



The isolated nucleic acid molecule of the present invention may be introduced into and 
expressed in any cell, for example a plant cell, fungal cell, insect cell, animal cell, yeast 
30 cell or bacterial cell. Those skilled in the art will be aware of any modifications which 
are required to the codon usage or promoter sequences or other regulatory 
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sequences, in order for expression to occur in such cells. 

A further aspect of the invention provides a method of assaying for the presence or 
absence of a starch synthase isoenzyme or the copy number of a gene encoding same 
5 in a plant, comprising contacting a biological sample derived from said plant with an 
isolated nucleic acid molecule derived from any one of SEQ ID NOS 1, 3. 5, 7 g 1 1_ 
16. 37. or 38. or any one of SEQ ID NOS: 25 to 34. or a complementary nucleotide 
sequence thereto for a time and under conditions suffident for hybridisation to occur 
and then detecting said hybridisation using a detection means. 

10 

The detection means according to this aspect of the invention is any nucleic acid 
based hybridisation or amplification reaction. 

The hexaploid nature of wheat prevents the straightfon^/ard identification of starch 
15 synthase allelic variants by hybridisation using the complete starch synthase-encoding 
sequence, because the similarities between the various alleles generally results in 
significant cross-hybridisation. Accordingly, sequence-specific hybridisation probes are 
required to distinguish between tiie various alleles. Similariy. wherein PGR is used to 
amplify specific allelic variants of a starch synthase gene, one or more sequence- 
20 specific amplification primers are generally required. As will be apparent from the 
amino acid sequence comparisons provided herein, such as in Figures 3 and 13. non- 
conserved regions of particular wheat starch synthase polypeptides are particulariy 
useful for the design of probes and primers that are capable of distinguishing between 
one or more starch synthase polypeptide isoenzyme or allelic variant. The present 
25 invention cleariy contemplates the design of such probes and primers based upon the 
sequence comparisons provided herein. 

In the perfomaance of this embodiment of the present invention, the present inventors 
particulariy contemplate the identification of wheat starch synthase null alleles or 
30 alternatively, mutations wherein specific amino acids are inserted or deleted or 
substituted, compared to one or more of the wheat SSII or SSIII alleles disclosed 
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herein. Such null alleles and other allelic variants are readily identifiable using PGR 
screening which employs amplification primers based upon the nucleotide and amino 
acid sequences disclosed herein for SSII and/or SSIII. Once identified, the various 
mutations can be stacked or pyramided into one or more new wheat lines, such as by 
5 introgression and/or standard plant breeding and/or recombinant approaches (eg. 
transformation, transfectlon, etc) thereby producing a novel germplasm which exhibits 
altered starch properties compared to existing lines. DNA markers based upon the 
nucleotide and amino acid sequences disclosed herein for SSII and/or SSIII can be 
employed to monitor the stacking of genes Into the new lines and to con-elate the 
10 presence of particular genes with starch phenotypes of said lines. 

In this regard, a significant advantage conferred by the present invention Is the design 
of new DNA markers that reveal polymorphisms such as, for example, length 
polymorphisms, restriction site polynrorphisms. and single nucleotide polymorphisms. 
15 amongst others, between wheat starch synthases and, in particular, between wheat 
GBSS and/or SSI and/or SSII and/or SSIII, or between allelic variants of one or more 
of said starch synthases, that can be used to identify the three genomes of hexaploid 
wheats (i.e., the A, B and D genomes). 

20 Preferably, such DNA markers are derived from the intron region of a starch synthase 
gene disclosed herein, more preferably the wheat SSII and/or the wheat SSIII gene. 
Those skilled in the art will be aware that such regions generally have a higher degree 
of variation than in the protein-encoding regions and, as a consequence, are 
particujariy useful in identifying specific allelic variants of a particular gene, such as 

25 allelic variants contained in any one of the three wheat genomes, or alternatively or in 
addition, for the purpose of distinguishing between wheat GBSS, SSI, SSII or SSIII 
genes. 



A further approach contemplated by the present inventors is the design of unique 
30 isoenzyme-specific and/or allele-specific peptides based upon the amino acid 
sequence disclosed herein as SEQ ID NOS: 25 and/or SEQ ID NO: 4 and/or SEQ ID 
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, NO: 6 and/or SEQ ID NO: 8 and/or SEQ ID NO: 10, which peptides are then used to 
produce polyclonal or monoclonal antibodies by conventional means. Alternatively, the 
genes encoding these polypeptides or unique peptide regions thereof can be 
introduced in an expressible format into an appropriate prokaryotic or eukaryotic 

5 expression system, where they can be expressed to produce the isoenzyme-specific 
and/or allele-specific peptides for antibody production. Such antibodies may also be 
used as markers for the purpose of both identifying parental lines and germplasms and 
monitoring the stacking of genes in new lines, using conventional immunoassays such 
as, for example, ELISA and western blotting. 

10 

A further aspect of the present invention utilises the above-mentioned nucleic acid 
based assay method in the breeding and/or selection of plants which express or do not 
express particular starch synthase isoenzymes or alternatively, which express a 
particular starch synthase isoenzyme at a particular level in one or more plant tissues. 
15 This aspect clearly extends to the selection of transformed plant material which 
contains one or more of the isolated nucleic acid molecules of the present invention. 

Yet another aspect of the present invention provides for the expression of the nucleic 
acid molecule of the present invention in a suitable host (e.g. a prokaryote or 
20 eukaryote) to produce full length or non-full length recombinant starch synthase gene 
products. 

Hereinafter the term "starch synthase gene product" shall be taken to refer to a 
recombinant product of a starch synthase gene of the present invention. 

25 

Preferably, the recombinant starch synthase gene product comprises an amino acid 
sequence having the catalytic activity of a starch synthase polypeptide or a functional 
mutant, derivative part, fragment, or analogue thereof. 

30 In a particularly preferred embodiment of the invention, the recombinant starch 
synthase gene product is selected from the following: 
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(i) a wheat starch synthase II (wSSII) polypeptide, protein or enzyme or 
functional subunit thereof which comprises an amino acid sequence which is at 
least about 85% identical overall to an amino acid sequence set forth in any one 
ofSEQIDNOS:2,4,or6; 

5 (ii) a wheat starch synthase III (wSSIII) polypeptide, protein or enzyme or 

functional subunit thereof which comprises an amino acid sequence which is at 
least about 85% identical overall to an amino acid sequence set forth in any one 
of SEQ ID NOS: 8 or 10; and 

(iii) a wheat starch synthase polypeptide, protein or enzyme or functional 
10 subunit thereof which comprises a conserved amino acid sequence having at 

least 25% identity to an amino acid sequence selected from the group 
consisting of: 

(a) KVGGLGDWTS; 

(b) GHTVEVILPKY: 

15 (c) HDWSSAPVAWLYKEHY; 

(d) GILNGIDPDIWDPYTD; 

(e) DVPIVGIITRLTAQKG; 
(0 NGQVVLLGSA; 

(g)AGSDFIIVPSIFEPCGLTQLVAMRYGS; 

20 (h)TGGLVDTV; (i) a wheat starch synthase II (wSSlI) 

polypeptide, protein or enzyme or functional subunit 
thereof which comprises an amino acid sequence 
which is at least about 85% identical overall to an 
amino acid sequence set forth in any one of SEQ 

25 IDNOS:2, 4. or6; 

(ii) a wheat starch synthase 111 (wSSIII) polypeptide, protein or enzyme or 
functional subunit thereof which comprises an amino acid sequence which is at 
least about 85% identical overall to an amino acid sequence set forth in any one 
of SEQ ID NOS: 8 or 10; 

30 (ill) a wheat starch synthase polypeptide, protein or enzyme or functional 

subunit thereof which comprises a conserved amino acid sequence having at 
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least 25% identity to. an amino acid sequence selected from the group 
consisting of: 

(a) KVGGLGDVVTS; 

(b) GHTVEVILPKY; 

5 (c) HDWSSAPVAWLYKEHY; 

(d) GILNGIDPDIWDPYTD; 

(e) DVPIVGIITRLTAQKG; 

(f) NGQWLLGSA; 

(g) AGSDFIIVPSIFEPCGLTQLVAMRYGS; and 
10 (h)TGGLVDTV; 

(i) KTGGLGDVAGA; 
G) GHRVMWVPRY; 
(k) NDWHTALLPVYLKAYY; 
(I) GIVNGIDNMEWNPEVD; 
15, (m) DVPLLGFIGRLDGQKG; 

(n) DVQLVMLGTG; 

(o)AGADALLMPSRF(EA/)PCGLNQLYAMAYGT; and 
{p)VGG(V/L)RDTV. 

20 Accordingly, the present invention clearly extends to homologues, analogues and 
derivatives of the amino acid sequences set forth herein as SEQ ID NOS: 2, 4, 6, 8 
and 10. 

In the present context, "homologues" of an amino acid sequence refer to those 
25 polypeptides, enzymes or proteins which have a similar catalytic activity to the amino 
acid sequences described herein, notwithstanding any amino acid substitutions, 
additions or deletions thereto. A homologue may be isolated or derived from the same 
or another plant species as the species from which the polypeptides of the invention 
are derived. 

30 

"Analogues" encompass polypeptides of the invention notwithstanding the occurrence 
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of any non-naturally occurring amino acid analogues therein. 

"Derivatives" include modified peptides in which ligands are attached to one or more 
of the amino acid residues contained therein, such as carbohydrates, enzymes, 
5 proteins, polypeptides or reporter molecules such as radionuclides or fluorescent 
compounds. Glycosylated, fluorescent, acylated or alkylated forms of the subject 
peptides are particularly contemplated by the present invention. Additionally, 
derivatives of an amino acid sequence described herein which comprises fragments 
or parts of the subject amino acid sequences are within the scope of the invention, as 
10 are homopolymers or heteropolymers comprising two or more copies of the subject 
polypeptides. Procedures for derivatizing peptides are well-known in the art. 

Substitutions encompass amino acid alterations in which an amino acid is replaced 
with a different naturally-occurring or a non-conventional amino acid residue. Such 
15 substitutions may be classified as "conservative*', in which an amino acid residue 
contained in a starch synthase gene product is replaced with another naturally- 
occurring amino acid of similar character, for example Giy^Ala, Val-^-^-lle^Leu, 
Asp^Glu. Lys^Arg, Asn^GIn or Phe<-*Trp<->Tyr. 

20 Substitutions encompassed by the present invention may also be "non-conservative", 
in which an amino acid residue which is present in a starch synthase gene product 
described herein is substituted with an amino acid with different properties, such as a 
naturally-occurring amino acid from a different group (eg. substituted a charged or 
hydrophobic amino acid with alanine), or alternatively, in which a naturally-occurring 

25 amino acid is substituted with a non-conventional amino acid. 

I- 

Non-conventional amino acids encompassed by the invention include, but are not 
limited to those listed in Table 2. 

30 Amino acid substitutions are typically of single residues, but may be of multiple 
residues, either clustered or dispersed. 
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Amino acid deletions will usually be of the order of about 1-10 amino acid residues, 
wliile insertions may be of any length. Deletions and insertions may be made to the 
N-terminus, the C-terminus or be internal deletions or insertions. Generally, insertions 
within the amino acid sequence will be smaller than amino- or carboxy-terminal fusions 
5 and of the order of 1-4 amino acid residues, 

A homologue, analogue or derivative of a starch synthase gene product as referred to 
herein may readily be made using peptide synthetic techniques well-known in the art. 
such as solid phase peptide synthesis and the like, or by recombinant DNA 
10 manipulations. Techniques for making substituent mutations at pre-determlned sites 
using recombinant DNA technology, for example by Ml 3 mutagenesis, are also well- 
known. The manipulation of nucleic acid molecules to produce variant peptides, 
polypeptides or proteins which manifest as substitutions, insertions or deletions are 
well-known in the art. 

15 

The starch synthase gene products described herein may be derivatized further by the 
inclusion or attachment thereto of a protective group which prevents, inhibits or slows 
proteolytic or cellular degradative processes. Such deriyatization may be useful where 
the half-life of the subject polypeptide is required to be extended, for example to 

20 increase the amount of starch produced in the endosperm or alternatively, to increase 
the amount of protein produced in a bacterial or eukaryotic expression system. 
Examples of chemical groups suitable for this purpose include, but are not limited to, 
any of the non-conventional amino acid residues listed in Table 2, in particular a D- 
stereoisomer or a methylated form of a naturally-occurring amino acid listed in Table 

25 1 . Additional chemical groups which are useful for this purpose are selected from the 
list comprising aryl or heterocyclic N-acyl substituents, polyalkylene oxide moieties, 
desulphatohirudin muteins. alpha-muteins, alpha-aminophosphonic acids, water- 
soluble polymer groups such as polyethylene glycol attached to sugar residues using 
hydrazone or oxime groups, benzodiazepine dione derivatives, glycosyl groups such 

30 as beta-glycosyiamine or a derivative thereof, isocyanate conjugated to a polyol 
functional group or polyoxyethylene polyol capped with diisocyanate, amongst others. 
Similarly, a starch synthase gene product or a homologue. analogue or derivative 
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thereof may be cross-linked or fused to itself or to a protease inhibitor peptide, to 
reduce susceptibility of said molecule to proteolysis. 

In a particularly preferred embodiment, the percentage similarity to In any one of SEQ 
5 ID NOS: 2, 4, 6, 8 or 10 is at least about 90%, more preferably at least about 95%. 
even more preferably at least about 97% and even more preferably at least about 
98%, or about 99% or 100%. 

In a related embodiment, the present invention provides a "sequencabiy pure" form of 
10 the amino acid sequence described herein. "Sequencabiy pure" is hereinbefore 
described as substantially homogeneous to facilitate amino acid determination. 

In a further related embodiment, the present invention provides a "substantially 
homogeneous" form of the subject amino acid sequence, wherein the term 

15 "substantially homogeneous" is hereinbefore defined as being in a form suitable for 
interaction with an immunologically interactive molecule. Preferably, the polypeptide 
is at least 20% homogeneous, more preferably at least 50% homogeneous, still more 
preferably at least 75% homogeneous and yet still more preferably at least about 95- 
1 00% homogenous, in terms of activity per microgram of total protein in the protein 

20 preparation. 

To produce the recombinant polypeptide of the present invention, the coding region 
of a starch synthase gene described herein or a functional homologue. analogue or 
derivative thereof is placed operably in connection with a promoter sequence in the 
25 sense orientation, such that a starch synthase gene product is capable of being 
expressed under the control of said promoter sequence. 

In the present context, the term "in operable connection with" means that expression 
of the isolated nucleotide sequence is under the control of the promoter sequence with 
30 which it is connected, regardless of the relative physical distance of the sequences 
from each other or their relative orientation with respect to each other. 
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Reference herein to a "promoter'' is to be taken in its broadest context and includes the 
transcriptional regulatory sequences of a classical genomic gene, including the TATA 
box which is required for accurate transcription initiation, with or without a CCAAT box 
sequence and additional regulatory elements (i.e. upstream activating sequences, 
5 enhancers and silencers) which alter gene expression in response to developmental 
and/or external stimuli, or in a tissue-specific manner, A promoter is usually, but not 
necessarily, positioned upstream or 5', of a structural gene, the expression of which 
it regulates. Furthermore, the regulatory elements comprising a promoter are usually 
positioned within 2 kb of the start site of transcription of the gene. 

10 

In the present context, the term "promoter" is also used to describe a synthetic or 
fusion molecule, or derivative which confers, activates or enhances expression of a 
structural gene or other nucleic acid molecule, particularly in a plant cell and more 
preferably in a wheat plant or other monocotyledonous plant cell, tissue or organ. 
15 Preferred promoters may contain additional copies of one or more specific regulatory 
elements, to further enhance expression and/or to alter the spatial expression and/or 
temporal expression. For example, regulatory elements which confer copper inducibility 
may be placed adjacent to a heterologous promoter sequence, thereby conferring 
copper inducibility on the expression of said molecule. 

20 

Those skilled in the art will be aware that in order to obtain optimum expression of the 
starch synthase gene of the present invention, it is necessary to position said gene in 
an appropriate configuration such that expression is controlled by the promoter 
sequence. Promoters are generally positioned 5' (upstream) to the genes that they 

25 control. In the construction of heterologous promoter/structural gene combinations it 
is generally preferred to position the promoter at a distance from the gene transcription 
start site that is approximately the sarne as the distance between that promoter and 
the gene it controls in its natural setting, i.e.. the gene from which the promoter is 
derived. As is known in the art. some variation in this distance can be accommodated 

30 without loss of promoter function. Similarly, the preferred positioning of a regulatory 
sequence element with respect to a heterologous gene to be placed under its control 
is defined by the positioning of the element in its natural setting, i.e., the genes from 
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which it is derived. Again, as is known in the art, some variation in this distance can 
also occur 

Examples of promoters suitable for expressing the starch synthase gene of the present 
5 invention include viral, fungal, bacterial, animal and plant derived promoters capable 
of functioning in prokaryotic or eukaryotic cells. Preferred promoters are those capable 
of regulating the expression of the subject starch synthase genes in plants cells, fungal 
cells, insect cells, yeast cells, animal cells or bacterial cells, amongst others. 
Particularly preferred promoters are capable of regulating expression of the subject 
10 nucleic acid molecules in monocotyledonous plant cells. The promoter may regulate 
the expression of the said molecule constitutively. or differentially with respect to the 
tissue in which expression occurs or. with respect to the developmental stage at which 
expression occurs, or in response to external stimuli such as physiological stresses, 
or plant pathogens, or metal ions, amongst others. 

15 

Accordingly, strong constitutive promoters are particularly preferred for the purposes 
of the present invention. 

Examples of preferred promoters include the bacteriophage T7 promoter, 
20 bacteriophage T3 promoter, SP6 promoter, lac operator-promoter, tac promoter, SV40 
late promoter, SV40 early promoter. RSV-LTR promoter, CMV IE promoter, CaMV 35S 
promoter, SCSV promoter, SCBV promoter and the like. 

Particularly preferred promoters operable in plant cells include, for example the CaMV 
25 35S promoter, and the SCBV promoter. Those skilled in the art will readily be aware 
of additional promoter sequences other than those specifically described. 

In a particularly preferred embodiment, the promoter may be derived from a genomic 
starch synthase gene. Preferably, the promoter sequence comprises nucleotide 
30 sequences that are linked in vivo to nucleotide sequences set forth in any one of SEQ 
ID NOs: 1. 3. 5. 7, 9.11-16, 37, or 38. By "linked in wVo" means that the promoter is 
present in its native state in the genome of a wheat plant v/here it controls expression 
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of the starch synthase gene of the present invention. 

Conveniently, genetic constructs are employed to facilitate expression of a starch 
synthase genetic sequence of the present invention or a functional derivative, part, 
5 homologue, or analogue thereof. To produce a genetic constmct. the starch synthase 
gene of the invention is Inserted into a suitable vector or episome molecule, such as 
a bacteriophage vector, viral vector or a plasmid. cosmid or artificial chromosome 
vector which is capable of being maintained and/or replicated and/or expressed in the 
host cell, tissue or organ into which it is subsequently introduced. The said genetic 
10 construct comprises the subject nucleic acid molecule placed operably under the 
control of a promoter sequence and optionally, a terminator sequence. 

The term "terminator" refers to a DNA sequence at the end of a transcriptional unit 
which signals termination of transcription. Terminators are 3 '-non-translated DNA 
15 sequences containing a polyadenylation signal, which facilitates the addition of 
polyadenylate sequences to the 3'-end of a primary transcript. Terminators active in 
bacteria, yeasts, animal cells and plant cells are known and described in the literature. 
They may be isolated from bacteria, fungi, viruses, animals and/or plants. 

20 Examples of terminators particularly suitable for use in expressing the nucleic acid 
molecule of the present invention in plant cells include the nopaline synthase (NOS) 
gene terminator of Agrobacterium tumefaciens, the terminator of the Cauliflower 
mosaic virus (CaMV) 35S gene, and the zein gene terminator from Zee mays. 

25 Genetic constructs will generally further comprise one or more origins of replication 
and/or selectable marker gene sequences. 

The origin of replication can be functional in a bacterial cell and comprise, for example, 
the pUC or the C0IEI origin. Alternatively, the origin of replication is operable in a 
30 eukaryotic cell, tissue and more preferably comprises the 2 micron {2/j,m) origin of 
replication or the SV40 origin of replication. 
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As used herein, the term "selectable marker gene" Includes any gene which confers 
a phenotype on a cell in which it is expressed to facilitate the identification and/or 
selection of cells which are transfected or transformed with a genetic construct of the 
invention or a derivative thereof. 

5 

Suitable selectable marker genes contemplated herein include the ampicillin-resistance 
gene (Amp"), tetracycline-resistance gene (To"), bacterial kanamycin-resistance gene 
(Kan'), is the zeocin resistance gene (Zeocin is a drug of bleomycin family which is 
trademark of InVitrogen Corporation), the AURI-C gene which confers resistance to the 

10 antibiotic aureobasidin A, phosphinothricin-resistance gene, neomycin 
phosphotransferase gene (npfll), hygromycin-resistance gene, p-glucuronidase (GUS) 
gene, chloramphenicol acetyltransferase (CAT) gene, green fluorescent protein- 
encoding gene or the luciferase gene, amongst others. Those skilled in the art will be 
aware of other selectable marker genes useful in the performance of the present 

15 invention and the subject invention is not limited by the nature of the selectable marker 
gene. 

Usually, an origin of replication or a selectable marker gene suitable for use in bacteria 
is physically-separated from those genetic sequences contained in the genetic 
20 construct which are intended to be expressed or transferred to a eukaryotic cell, or 
integrated into the genome of a eukaryotic cell. 

Standard methods can be used to introduce genetic constructs into a cell, tissue or 
organ for the purposes of modulating gene expression. Particularly prererred methods 

25 suited to the introduction of synthetic genes and genetic constructs comprising same 
to eukaryotic cells include liposome-mediated transfection or transformation, 
transformation of cells with attenuated virus particles or bacterial cells and standard 
procedures for the transformation of plant and animal cells, tissues, organs or 
organisms. Any standard means may be used for their introduction including cell 

30 mating, transformation or transfection procedures known to those skilled in the art or 
described byAusubel et al, (1992). 
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In a further embodiment of the present invention, the starch synthase genes of the 
present invention and genetic constructs comprising same are adapted for integration 
into the genome of a cell in which it is expressed. Those skilled in the art will be aware 
that, in order to achieve integration of a genetic sequence or genetic construct into the 
5 genome of a host cell, certain additional genetic sequences may be required. In the 
case of plants, left and right border sequences from the T-DNA of the Agrobacterium 
tumefaciens Ti plasmid will generally be required. 

The invention further contemplates increased starch and/or modified starch 
10 composition in transgenic plants expressing the nucleic acid molecule of the invention 
in the sense orientation such that the activity of one or more starch synthase 
isoenzymes is increased therein. By increasing the level of one or more starch 
synthase isoenzymes, the deposition of starch in the amyloplast or chloroplast is 
increased and/or a modified starch granule structure is produced and/or starch 
15 composition is modified and/or the amylose/amylopectin ratio is altered in the plant. 

Wherein it is desired to increase the synthesis of a particular starch synthase 
isoenzyme in a plant cell, the coding region of a starch synthase gene is placed 
operably behind a promoter, in the sense orientation, such that said starch synthase 
20 is expressed under the control of said promoter sequence. In a preferred embodiment, 
the starch synthase genetic sequence is a starch synthase genomic sequence, cDNA 
molecule or protein-coding sequence. 

Wherein it is desirable to reduce the level of a particular starch synthase isoenzyme 
25 in a plant cell, the nucleic acid molecule of the present invention can be expressed in 
the antisense orientation, as an antisense molecule or a ribozyme molecule, under the 
control of a suitable promoter. 

Alternatively, the nucleic acid molecule of the present invention may also be expressed 
30 in the sense orientation, in the form of a co-suppression molecule, to reduce the level 
of a particular starch synthase isoenzyme in a plant cell. As will be known to those 
skilled in the art, co-suppression molecules that comprise inverted repeat sequences 
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of a target nucleic acid molecule provide optimum efficiency at reducing expression of 
said target nucleic acid molecule and. as a consequence, the present invention clearly 
contemplates the use of inverted repeat sequences of any one or more of the starch 
synthase genetic sequences exemplified herein, or inverted repeat sequences of a 
5 homologue, analogue or derivative of said starch synthase genetic sequences, to 
reduce the level of a starch synthase isoenzyme in a plant. 

The expression of an antisense, ribozyme or co-suppression molecule comprising a 
starch synthase gene in a cell such as a plant cell, fungal cell, insect cell, .animal cell. 

10 yeast cell or bacterial cell, may also increase the availability of carbon as a precursor 
for a secondary metabolite other than starch (e.g. sucrose or cellulose). By targeting 
the endogenous starch synthase gene, expression is diminished, reduced or otherwise 
lowered to a level that results In reduced deposition of starch in the amyloplast or 
chloroplast and/or leads to modified starch granule structure and/or composition 

15 and/or altered amylose/amylopectin ratio. 

Accordingly, a further aspect of the present invention provides a method of modifying 
the starch content and/or starch composition of one or more tissues or organs of a 
plant, comprising expressing therein a sense molecule, antisense molecule, ribozyme 

20 molecule, co-suppression molecule, or gene-targeting molecule having at least about 
85% nucleotide sequence identity to any one of any one of SEQ ID NOS: 1,3,5,7. 
9.11-16, 37, or 38, or a complementary nucleotide sequence thereto for a time and 
under conditions sufficient for the enzyme activity of one or more starch synthase 
isoenzymes to be modified. This aspect of the invention clearly extends to the 

25 introduction of the sense molecule, antisense molecule, ribozyme molecule, co- 
suppression molecule, or gene-targeting molecule to isolated plant cells, tissues or 
organs or organelles by cell fusion or transgenic means and the regeneration of intact 
plants therefrom. 

30 Co-suppression is the reduction in expression of an endogenous gene that occurs 
when one or more copies of said gene, or one or more copies of a substantially similar 
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gene are introduced into the cell, preferably in the form of an inverted repeat structure. 

The present inventors have discovered that the genetic sequences disclosed herein 
are capable of being used to modify the level of starch when expressed, particularly 
5 when expressed in plants cells. Accordingly, the present invention clearly extends to 
the modification of starch biosynthesis in plants, in particular wheat or barley plants or 
a progenitor plant species, or a relative thereto such as the diploid Thticum tauschii 
or other diploid, tetraploid, aneuploid, polyploid, nullisomic, or a wheat/barley addition 
line, amongst others. 

In particular, the present invention contemplates decreased starch production and/or 
modified starch composition in transgenic plants expressing the nucleic acid molecule 
of the invention in the antisense orientation or alteratively, expressing a ribozyme or 
co-suppression molecule comprising the nucleic acid sequence of the invention such 
15 that the activity of one or more starch synthase Isoenzymes is decreased therein. 



In the context of the present invention, an antisense molecule is an RNA molecule 
20 which is transcribed from the complementary strand of a nuclear gene to that which is 
normally transcribed to produce a "sense" mRNA molecule capable of being translated 
into a starch synthase polypeptide. The antisense molecule is therefore 
complementary to the mRNA transcribed from a sense starch synthase gene or a part 
thereof. Although not limiting the mode of action of the antisense molecules of the 
25 present invention to any specific mechanism, the antisense RNA molecule possesses 
the capacity to form a double-stranded mRNA by base pairing with the sense mRNA, 
which may prevent translation of the sense mRNA and subsequent synthesis of a 
polypeptide gene product. 

30 Ribozymes are synthetic RNA molecules which comprise a hybridising region 
complementary to two regions, each of at least 5 contiguous nucleotide bases in the 
target sense mRNA. In addition, ribozymes possess highly specific endoribonuclease 
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activity, which autocatalytically cleaves the target sense mRNA. A complete 
description of the function of ribozymes is presented by Haseloff and Gerlach (1988) 
and contained in International Patent Application No. WO89/05852. 

5 The present invention extends to ribo2yme which target a sense mRNA encoding a 
native starch synthase gene product, thereby hybridising to said sense mRNA and 
cleaving it, such that it is no longer capable of being translated to synthesise a 
functional polypeptide product. 

10 According to this embodiment, the present Invention provides a ribozyme or antisense 
molecule comprising at least 5 contiguous nucleotide bases derived from any one of 
SEQ ID NOS: 1. 3. 5, 7, 9,11-16. 37, or 38, or a complementary nucleotide sequence 
thereto or a homologue, analogue or derivative thereof, wherein said antisense or 
ribozyme molecule is able to form a hydrogen-bonded complex with a sense mRNA 

15 encoding a starch synthase gene product to reduce translation thereof. 

In a preferred embodiment, the antisense or ribozyme molecule comprises at least 10 
to 20 contiguous nucleotides derived from any one of SEQ ID NOS: 1 , 3. 5, 7, 9,1 1-16, 
37. or 38, or a complementary nucleotide sequence thereto or a homologue, analogue 
20 or derivative thereof. Although the preferred antisense and/or ribozyme molecules 
hybridise to at least about 10 to 20 nucleotides of the target molecule, the present 
invention extends to molecules capable of hybridising to at least about 50-100 
nucleotide bases in length, or a molecule capable of hybridising to a full-length or 
substantially full-length mRNA encoded by a starch synthase gene. 

25 

Those skilled in the art will be aware of the necessary conditions, if any. for selecting 
or preparing the antisense or ribozyme molecules of the invention. 

It is understood in the art that certain modifications, including nucleotide substitutions 
30 amongst others, may be made to the antisense and/or ribozyme molecules of the 
present invention, v/ithout destroying the efficacy of said molecules in inhibiting the 
expression of a starch synthase gene. It is therefore within the scope of the present 
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invention to include any nucleotide sequence variants, homologues. analogues, or 
fragments of the said gene encoding same, the only requirement being that said 
nucleotide sequence variant, when transcribed, produces an antisense and/or 
ribozyme molecule which is capable of hybridising to a sense mRNA molecule which 
5 encodes a starch synthase gene product 

Gene targeting is the replacement of an endogenous gene sequence within a cell by 
a related DNA sequence to which it hybridises, thereby altering the form and/or 
function of the endogenous gene and the subsequent phenotype of the cell. According 

10 to this embodiment, at least a part of the DNA sequence defined by any one of SEQ 
ID NOS: 1 , 3, 5, 7, 9.1 1-16, 37, or 38 may be introduced into target cells containing an 
endogenous gene that encodes a particular starch synthase isoenzyme, thereby 
replacing said endogenous gene. According to this embodiment, the polypeptide 
product of the gene targetting molecule generally encodes a starch synthase 

15 isoenzyme that possesses different catalytic activity to the polypeptide product of the 
endogenous gene, producing in turn modified starch content and/or composition in the 
target cell. 

The present invention extends to genetic constructs designed to facilitate expression 
20 of a sense molecule, an antisense molecule, ribozyme molecule, co-suppression 
molecule, or gene targeting molecule of the present invention. The requirements for 
expressing such molecules are similar to those for expressing a recombinant 
polypeptide as described supra. 



25 The present invention further extends to the production and use of starches and 
proteins produced using the novel genes described herein. Modified starches 
produced by plants which have been selected using marker-assisted selection, or 
alternatively, produced by transgenic plants carrying the introduced starch synthase 
genes, are particularly suitable for use in food products, such as, for example, flour 

30 and flour-based products, in particular those products selected from the group 
consisting of: flour-based sauce; leavened bread; unleavened bread; pasta, noodle; 
cereal; snack food; cake; and pastry. Modified proteins are also suitable for use in non- 
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food products, such as. for example, those non-food products selected from the group 
consisting of: films; coatings; adhesives; building materials; and packaging materials. 

Additionally, starch hydrolysates or undegraded starches are both useful in industry 
5 and, as a consequence, the present invention is useful in applications relating to the 
use of both starch hydrolysates and undegraded starches. By "starch hydrolysates" is 
meant the glucose and glucan components that are obtainable by the enzymatic or 
chemical degradation of starch in chemical modifications and processes, such as 
fermentation. 

10 

Starch produced by plants expressing the sense, antisense, co-suppression, gene- 
targetting or ribozyme molecules of the present invention may exhibit modified 
viscosities and/or gelling properties of its glues when compared to starch derived from 
wild-type plants. Native starches produced by the performance of the inventive method 

15 are useful as an additive in the following: (i) foodstuffs, for the purpose of increasing 
the viscosity or gelling properties of food; (ii) in non-foodstuffs, such as an adjuvant or 
additive in the paper and cardboard industries, for retention or as a size filler, or as a 
solidifying substance or for dehydration, or film coating, amongst others; (iii) in the 
adhesive industry as pure starch glue, as an additive to synthetic resins and polymer 

20 dispersions, or as an extenders fpr synthetic adhesives; (iv) in the textile and textile 
care industries to strengthen woven products and reduce burring or to thicken dye 
pastes; (v) in the building industry, such as a binding agent in the production of 
gypsum plaster boards, or for the deceleration of the sizing process; (vi) in ground 
stabilization or for the temporary protection of ground particles against water in artificial 

25 earth shifting; (vii) as a wetting agent in plant protectants and fertilizers; (viii) as a 
binding agent in dnjgs. pharmaceuticals and medicated foodstuff such as vitamins, etc; 
(ix) as an additive in coal and briquettes; (xi) as a flocculent in the processing of coal 
ore and slurries; (xii) as a binding agent in casting processes to increase flow 
resistance and improve binding strength; and (xiii) to improve the technical and optical 

30 quality of rubber and plastic products. Additional applications are not excluded. 

A further aspect of the present invention provides an isolated promoter that is operable 
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in the endosperm of a monocotyledonous plant cell, tissue or organ, and preferably in 
the endosperm of a monocotyledonous plant cell, tissue or organ. According to this 
embodiment, it is preferred that the promoter is derived from a starch synthase gene 
of the present invention, such as a promoter that is linked in vivo to any one of SEQ 
5 ID NOS: 1. 3, 5, 7. 9,11-16, 37, or 38, or a complementary nucleotide sequence 
thereto. 

In a particularly preferred embodiment, the promoter comprises a nucleotide sequence 
derivable from the 5'-upstream region of SEQ ID NO: 1 1 or SEQ ID NO: 37 or SEQ ID 

10 NO: 38, or a complementary nucleotide sequence thereto, an more preferably 
comprises nucleotides 1 to about 287 of SEQ ID NO: 11, or nucleotides 1 to about 
1416 of SEQ ID NO: 37, or nucleotides 1 to about 973 of SEQ ID NO: 38, or a 
complementary nucleotide sequence thereto. The present invention clearly extends 
to promoter sequences that comprise further nucleotide sequences in the region 

15 upstream of the stated nucleotide sequence that are linked in vivo to said nucleotide 
sequence in the wheat genome. 

In a related embodiment, the promoter sequence of the present invention will further 
comprise an exon sequence derived from a starch synthase gene, such as, for 

20 example, an intron I sequence described herein, or a complementary nucleotide 
sequence thereto. Those skilled in the art will be aware that the inclusion of such 
nucleotide sequences may increase the expression of a heterologous structural gene, 
the expression of which is controlled thereby. Preferred intron I sequences include, 
for example, nucleotide sequences in the region of about position 1744 to about 1847 

25 of SEQ ID NO: 37, and/or about position 1 1 00 to about position 2056 of SEQ ID NO: 
38. Additional sequences comprising intron/exon junction boundary sequences which 
are readily determined by those skilled in the art are not excluded. 

The present invention further extends to the expression of any structural gene operably 
30 under the control of the starch synthase promoter sequence exemplified herein or a 
functional homologue, analogue or derivative of said promoter sequence. 
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As with other embodiments described herein for expression in cells, a genetic 
construct may be employed to effect said expression and the present invention clearly 
extends to said genetic constructs. 

5 The polypeptide encoded by the structural gene component may be a reporter 
molecule which is encoded by a gene such as the bacterial P-glucuronidase gene or 
chloramphenicol acetyltransferase gene or alternatively, the firefly luciferase gene. 
Alternatively, wherein it is desirable to alter carbon partitioning within the endosperm, 
the polypeptide may be an enzyme of the starch sucrose biosynthetic pathways. 
10 Preferably, the promoter sequence is used to regulate the expression of one or more 
of the starch synthase genes of the present invention or a sense, antisense. ribozyme. 
co-suppression or gene-targetting molecule comprising or derived from same. 

Recombinant DNA molecules carrying the aforesaid nucleic acid molecule of the 

15 present invention or a sense, antisense, ribozyme, gene-targetting or co-suppression 
molecule and/or genetic construct comprising same, may be introduced into plant 
tissue, thereby producing a "transgenic plant", by various techniques known to those 
skilled in the art. The technique used for a given plant species or specific type of plant 
tissue depends on the known successful techniques. Means for introducing 

20 recombinant DNA into plant tissue include, but are not limited to. transformation 
(Paszkowski ef a/., 1984). electroporation (Fromm etal., 1985). or microinjection of the 
DNA (Crossway et aL, 1 986), or T-DNA-mediated transfer from Agrobactenum to the 
plant tissue. Representative T-DNA vector systems are described in the following 
references: An ef a/.(1985); Herrera-Estrella et al, (1983a. b); Hen-er?.-Estrella et al. 

25 (1985). Once introduced into the plant tissue, the expression of the introduced gene 
may be assayed in a transient expression system, or it may be determined after 
selection for stable integration within the plant genome. Techniques are known for the 
in vitro culture of plant tissue, and in a number of cases, for regeneration into whole 
plants. Procedures for transferring the introduced gene from the originally transformed 

30 plant into commercially useful cultivars are known to those skilled in the art. 

In general, plants are regenerated from transformed plant cells or tissues or organs on 
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hormone-containing media and the regenerated plants may take a variety of forms, 
such as chimeras of transformed cells and non-transformed cells; clonal transformants 
(e.g.. all cells transfomried to contain the expression cassette); grafts of transformed 
and untransformed tissues (e.g... a transfomied root stock grafted to an untransformed 
5 scion in citms species). Transfomned plants may be propagated by a variety of means, 
such as by clonal propagation or classical breeding techniques. For example, a first 
generation (or T1) transfomied plants may be selfed to give homozygous second 
generation (or T2) transfonned plants, and the T2 plants further propagated through 
classical breeding techniques. 

10 

Accordingly, a still further aspect of the present invention contemplates a transgenic 
plant comprising an introduced sense molecule, antisense molecule, ribozyme 
molecule, co-suppression molecule, or gene-targeting molecule having at least about 
85% nucleotide sequence identity to any one of any one of SEQ ID NOS: 1 , 3. 5, 7, 
15 9.11-16. 37. or 38, or a complementary nucleotide sequence thereto or a genetic 
constoict comprising same. The present invention further extends to those plant parts, 
propagules and progeny of said transgenic plant or derived therefrom, the only 
requirement being that said propagules and progeny also carry the introduced nucleic 
acid molecule(s). 

20 

The present invention is further described by reference to the following non-limiting 
examples. 

EXAMPLE 1 
Plant material 

25 Genetic stocks of hexaploid bread wheat Triticum aestivum cv. Chinese Spring with 
various chromosome additions and deletions were kindly supplied by Dr E. Lagudah 
(CSIRO Plant Industry, Canberra) and derived from stocks described in Sears and 
Miller (1985). The hexaploid {Triticum aestivum) wheats cv Gabo and cv Wyuna were 
grown in controlled growth cabinet conditions (18°C day and 13 C night, with a 

30 photoperiod of 16 h). Wheat leaves and florets prior to anthesis, and endosperm were 
collected over the grain filling period, immediately frozen in liquid nitrogen and stored 
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at -80°C until required. 



EXAMPLE 2 

Gel Electrophoresis, Antibodies and Immunoblotting 

5 Monoclonal antibodies against the Sgp-1 proteins, and their use in the immunoblotting 
of SDS-PAGE gels have been described previously (Rahman ef a/., 1995). 



EXAMPLE 3 
Preparation of total RNA from wheat 

10 Total RNA was isolated from the leaf, floret and endosperm tissues of wheat 
essentially as described by Higgins et sL (1976) or Rahman et aL (1998), RNA was 
quantified by UV absorption and by separation in 1.4% (w/v) agarose-formaldehyde 
gels which were then visualised under UV light after staining with ethidium bromide. 



15 EXAMPLE 4 

Construction and screening of cDNA libraries 

A first cDNA library, an expression cDNA library of wheat endosperm, was constructed 
from mRNA isolated from wheat cy Chinese Spring. RNA from 5. 7, 9, 1 1 and 1 3 days 
after anthesis was pooled and random primers were used for the first strand of cDNA 
20 synthesis. Monoclonal antibodies against 100 -105 kDa proteins in wheat starch 
granules (Rahman ef a/., 1995) were used for immunoscreening of the expression 
cDNA library. 

A second cDNA library was constructed from the endosperm mRNA of the hexaploid 
25 Triticum aestivum cultivar Wyuna. 8-12 days after anthesis. as described by Rahman 
et aL (1997). This library was screened with a 85-bp cDNA fragment, wSSIIpl . which 
was obtained by immunoscreening of the expression cDNA library as described above. 
The wSSIIpl probe corresponded to nucleotide positions 988 to 1072 of wSSIIB (SEQ 
ID N0:1) at the hybridisation conditions as described eariier (Rahman et al,, 1998). 

30 

A third cDNA library was constructed from RNA from the endosperm of the hexaploid 



wo 00/66745 

PCT/AUOO/00385 



-56- 



Triticum aestivum cultivar Rosella as described by Rahman et al. (1 997). This library 
was screened with a 347.bp cDNA fragment, wSSIIIpl for the first screening, and a 
478-bp cDNA fragment wSSIIIp3 for the second screening using the hybridisation 
conditions described herein. 



EXAMPLE 5 

Construction and screening of Triticum tauschii genomic library 

The genomic library used in this study, prepared from Triticum tauschii, var 
strangulata. (Accession Number CPI 1 10799). has been described in Rahman et al., 
10 (1997). Of all the accessions of T. tauschii surveyed. DNA marker analysis suggests 
that the genome of CPI 110799 is the most closely related to the D genome of 
hexaploid wheat (Lagudah et ai, 1 991 ). 

Hybridisations were earned out in 25% formamide, 6 x SSC, 0.1 % SDS at 42 °C for 1 6 
15 hours, then filters were washed 3 times using 2 x SSC containing 0.1 % SDS at eS'C 
for 1 hour per wash. 

For the isolation of a genomic wSSII clone, the probe comprised the PCR-derived DNA 
fragment wSSIIp2 and positive-hybridising plaques were digested using the restriction 
20 enzyme BamHI. separated on a 1% agarose gel. transfen-ed to nitrocellulose 
membrane and hybridised to probe wSSIIp4 comprising nucleotides 1 to 367 of the 
wSSIIA cDNA clone, using the conditions described by Rahman et ai. (1 997). 

For the isolation of a genomic wSSIII clone, plaques hybridising to the PCR-derived 
25 DNA fragment wSSIIIpl from clone wSSIII.B3 (i.e. nucleotides 3620 to 3966 of SEQ 
ID NO:7) were selected and re-screened until plaque-purified. 

EXAMPLE 6 
DNA sequencing and analysis 

30 DNA sequencing was performed using the automated ABI system with dye terminators 
as described by the manufacturers. DNA sequences were analysed using the GOG 
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suite of programs (Devereaux et aL, 1984). 

EXAMPLE 7 
DNA and RNA analysis 

5 DNA was isolated and analysed as previously described (Maniatis et ai, 1982; 
Rahman et ai, 1998). Approximately 20 of DNA was digested with restriction 
enzymes BamH\, Dra\ and EcoRI, separated on a 1% agarose gel and transferred to 
reinforced nitrocellulose membranes (BioRad) and hybridised with ^^P-labelled DNA 
probe, either wSSIIIpl , corresponding to nucleotides 3620 to 3966 of the wheat SSIII 
10 gene, or alternatively, with the entire wSSII cDNA clone. DNA fragment probes were 
labelled with the Rapid Multiprime DNA Probe Labelling Kit (Promega). 

The hybridisation and wash conditions were performed as described in Rahman ef a/. 
(1997). For RNA analysis. 10 A^g of total RNA was separated in a 1.4% agarose- 

15 formaldehyde gel and transferred to a Hybond N+ membrane (Amersham), and 
hybridised with cDNA probe at 42°C as previously described by Khandjian et aL, 
(1987) or Rahman et aL, (1998). After washing for 30 minutes at 65°C with 2x SSC. 
0.1% SDS; followed by three washes of 40 minutes at 65°C with 0.2x SSC, 1% SDS, 
the membranes were visualised by overnight exposure at -SO'^C with Kodak MR X-ray 

20 film. 

EXAMPLE 8 

Expression of wheat Sgp-1 polypeptides in the wheat endosperm 

The development and use of monoclonal antibodies to the Sgp-1 proteins has been 
25 described previously (Rahman et aL, 1995). These antibodies were used by the 
present inventors to characterise the expression and localisation of the Sgp-1 proteins. 

The proteins found in the matrix of the wheat starch granule are shown in Figure 1, 
lane 1. The remaining lanes show an immunoblot of proteins from the soluble phase 
30 (Figure 1; lanes 2-4) and the starch granule (Figure 1; lanes 5-7), respectively, 
following SDS-PAGE. In addition to cross-reactivity with the 100-105 kDa proteins, a 
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weak cross-reaction with a 50 kDa protein in both the granule and the soluble fractions 
were observed (Figure 1). The Sgp-1 polypeptides are present in the starch granule 
throughout endosperm development (Figure 1; lanes 5-7. also see Rahman et aL, 
1995). However, as the endosperms matures, there is a reduction in the amount of 
5 Sgp-1 protein found in the soluble fraction. Lane 4 shows that by 25 days after 
anthesis, the level of these proteins in the soluble fraction is substantially reduced. 
This observation is consistent with previous results from Rahman et aL. (1995), who 
suggested that the Sgp-1 proteins were exclusively granule bound based on studies 
of granules from endosperm in mid-late stages endospem development, however, 
10 these results suggest that the partitioning of these proteins between the granule and 
the soluble phase changes during development. 

EXAMPLE 9 

isoiation of cDNA clones encoding wheat starch synthase II (wSSII) proteins 
15 Monoclonal antibodies against Sgp-1 polypeptides (Rahman et aL, 1995) were used 
to probe the expression library described in Example 4 (i.e. the first cDNA library). 
Three immunoreactive plaques were identified and sequenced. One done, designated 
wSSIIpl , contained an 85-bp cDNA insert with homology to maize SSIIa (Ham et aL, 
1 998). 

20 

DNA from the wSSIIpl clone was. used as a probe in the hybridisation screening of the 
second cDNA library, prepared from Triticum aesf/vL/m cultivarWyuna endosperm RNA 
as described in Example 4. Ten hybridising cDNA clones were selected and 
sequenced. On the basis of the DNA sequences obtained, the 10 cDNA clones can be 
25 classified into three groups. Group 1 contains 7 cDNA clones, group 2 contains 2 
cDNA clones and group 3 contains 1 cDNA clone. 

The longest clone from group 1 (designated wSSIIB) is 2939 bp in length (SEQ ID 
NO:1) and encodes a 798 -amino acid polypeptide in the region from nucleotide 
30 position 176 to nucleotide position 2569 (SEQ ID NO:2). 
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The longest clone from group 2 (designated wSSIIA) is 2842 bp in length (SEQ ID 
NO:3) and encodes a 799 -amino acid polypeptide in the region from nucleotide 
position 89 to nucleotide position 2485 (SEQ ID NO:4). 

5 The cDNA from group 3 is a partial cDNA clone (designated wSSIID), which is 2107 
bp in length (SEQ ID NO:5) and encodes a 597 -amino acid polypeptide in the region 
from nucleotide position 1 to nucleotide position 1791 (SEQ ID NO:6). The encoded 
polypeptide is approximately a 200 amino acid residues shorter than that of 
polypeptides encoded by longest clones of group 1 or 2 clones, respectively (Figure 
10 2). 

Comparison of the three cDNA clones, wSSIIB, wSSIIA and wSSIID shows that they 
share 95.7% to 96.6% identity at the amino acid level, with variation at 44 amino acid 
positions between the three sequences (Figure 3). Of the 44 amino acid changes 

15 between these sequences, 31 changes occur in the N-terminal region (residues 1 to 
300). 10 changes occur in the central region (residues 301 to 729) and 3 changes 
occur in the C-terminal region (residues 730 to 799). The wSSIIA polypeptide (799 
amino acid residues) and wSSIIB polypeptide (798 amino acid residues) sequences 
differ in length by a single amino acid residue, due to the deletion of Asp-69 from the 

20 wSSIIB polypeptide sequence. 

A comparison of the nucleotide sequences of the wSSIA, wSSIIB and wSSIID cDNA 
clones with the nucleotide sequence of the wSSllpl cDNA obtained by 
immunoscreening confirms that the wSSIIpl sequence is found in each cDNA (Figure 

25 3). The peptide encoded by the wSSIIpl cDNA clone corresponds to amino acid 
residues in the region from residue 272 to residue 298 of the wSSllA polypeptide, and 
to amino acid residues in the region from residue 271 to residue 297 of the wSSIIB 
polypeptide (see Figure 3). Thus, the peptide epitope encoded by wSSIIpl that reacts 
with the anti-Sgp-1 monoclonal antibodies can therefore be localised to this region of 

30 the wSSIlA and wSSlIB polypeptides and to the corresponding region of the wSSIlD 
polypeptide. 
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Notwithstanding that a region having about 63% amino acid sequence identity to the 
peptide epitope encoded by clone wSSIIpl is found in the maize SSIIa polypeptide 
(Figure 3), the degree of amino acid consen/ation between maize and wheat 
sequences in this region of the polypeptide is insufficient for immunological cross- 
5 reactivity to occur between these species using the monoclonal antibodies to the 
wheat Sgp-1 proteins described by Rahman et al. (1 995). Additionally, this peptide 
epitope is not found in granule-bound starch synthases, SSI, or SSIII (data not shown). 

The wSSIIB cDNA (SEQ ID NO:1) encodes an amino acid sequence comprising the 
10 peptide motif AAGKKDAGID (SEQ ID NO: 18) between residues 60 and 69 of SEQ ID 
NO:2 (Figure 3) which, with the exception of the second residue, is identical to the N- 
terminal of the 100 kDa (AVlGKKDAGID: SEQ ID NOS:19 and 20) protein (Sgp-B1) 
from the wheat starch granule (note that the sequence given in Rahman et al., 1995 
(AVlGKKDAL: SEQ ID NOS: 21 and 22 ) has been revised following further amino acid 
15 sequence analysis). 

The wSSIIA cDNA clone (SEQ ID NO:3) encodes an amino acid sequence comprising^ 
the peptide motif AAGKKDARVDDDAA (SEQ ID NO: 23) at residues 60 to 73 of SEQ 
ID NO:4, which is about 66% identical to the N-terminal amino acid sequence (i.e. 
20 ALGKKDAGIVDGA: SEQ ID NO: 24) of the 104 kDa and 105 kDa starch granule 
proteins, Sgp-D1 and Sgp-AI respectively, as determined by sequence analysis of 
isolated protein (Rahman etaL, 1995). 

Furthermore. Takaoka et al. (1997) reported the amino acid sequences of 3 
25 polypeptides obtained from sequencing starch granule proteins derived from the Sgp-1 
proteins. Peptide 3 described by Takaoka et al. (1997) corresponds to amino acid 
residues 378 to 387 of the amino acid sequence of the wSSIIA cDNA (SEQ ID N0:4; 
Figure 3). Peptides 1 and 2 described by Takaoka et al. (1 997) could not be detected 
in the amino acid sequences of the wSSII cDNA clones of the present invention, 
30 however peptide 1 of Takaoka et al. (1997) can be found in the amino acid sequences 
of SSI from maize, rice, wheat and potato (data not shown). 
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Denyer et al. (1995) demonstrated that the Sgp-1 proteins possess starch synthase 
activity and, as a consequence, the wSSIIB. wSSIA and wSSIID cDNA clones encode 
starch synthase enzynnes that are differentially expressed in a developmentally- 
regulated manner in both the soluble and granule-bound fractions of the endosperm 
5 (Figure 1). Based on the nomenclature suggested by Harn et al. (1998), it is 
appropriate to describe the Sgp-1 proteins as "starch synthases" rather than "granule- 
bound starch synthases". 

EXAMPLE 10 

10 Analysis of wheat starch synthase II mRNA expression 

The mRNA for wheat starch synthase II could be detected in leaves, pre-anthesis 
florets and endosperm of wheat when total RNAs isolated from these tissue were 
probed with a PGR probe, wSSIIp2, corresponding to nucleotide positions 1435 to 
1835 bp of wSSIIB-cDNA (SEQ ID NO:1; Figure 4). Unlike wSSI. which could not be 

15 detected in wheat leaves derived from plants grown under the same conditions, wSSIl 
genes are highly-expressed in the leaves (Figure 4, lane 1), and expressed at an 
intermediate level in pre-anthesis florets (Figure 4, lane 2), and at much lower levels 
in developing wheat endosperm cells (Figure 4, lanes 3-11). In contrast, the maize 
SSIIa is expressed predominantly in the endosperm, whilst the maize SSIIb is detected 

20 mainly in the leaf, albeit at low levels (Ham et aL, 1998). 

The wSSII mRNA was detectable in the endosperm 6 days after anthesis and mRNA 
levels increase between 8 and 18 days post-anthesis, after which time levels of mRNA 
decline. 

25 

Southern blotting experiments in wheat demonstrated that the wSSIIp2 probe used 
detected only a single copy of the SSII gene in each genome (data not shown). Thus, 
it is unlikely that this probe cross-hybridised with mRNAs encoded by genes other than 
wSSII. 



30 
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EXAMPLE 11 

Chromosomal localization of the wheat wSSlI genes. 

I. Amplification of specific c DNA regions of wheat starch synthase II using PCR 
Two PCR products, wSSIIp2 and wSSIIp3 were amplified from the cDNA clone wSSIIB 
5 and used for the northern hybridisation and Southem hybridisation, respectively. 

The primers sslla (5* TGTTGAGGTTCCATGGCACGTTC 3': SEQ ID NO: 25) and ssllb 
(5* AGTCGTTCTGCCGTATGATGTCG 3': SEQ ID NO: 26) were used to amplify the 
cDNA fragment wSSIIp2 (i.e. nucleotide positions 1435 to 1835 of SEQ ID NO:1). 

10 

The primers ssllc (5' CCAAGTACCAGTGGTGAACGC 3': SEQ ID NO: 27) and sslld 
(5* CGGTGGGATCCAACGGCCC 3': SEQ ID NO: 28) were used to amplify the cDNA 
fragment wSSIIp3 (i.e. nucleotide positions 2556 to 2921 of SEQ ID NO:1). 

15 The amplification reactions were performed using a FTS-1 thermal sequencer (Corbett, 
Australia) for 1 cycle of 95X for 2 minutes; 35 cycles of 95°C for 30 seconds, 60°C for 
1 minutes, 72°C for 2 minutes and 1 cycle of 25°C for 1 minute. 

II. PCR and nucleotide se quence analysis of 3' sequences of wheat SSII oenes 
20 Genomic DNA was extracted from wild-type Chinese Spring wheat, and from three 
nullisomic-tetrasomic lines of chromosome 7 of Chinese Spring wheat, and from 
Triticum tauschii (var strangulata, accession number CPI 100799), and used as a 
template for the amplification and nucleotide sequence analysis of wheat SSII genes. 

25 RFLP analysis of SamHI and EcoRI restricted DNA from each wheat or T. Tauschii line 
was carried out using the wSSIIp3 fragment as a probe. Three hybridising bands were 
obtained which could be assigned to chromosomes 7A, 7B and 7D, respectively (data 
not shown). This analysis indicates that there is a single copy of the wSSIl gene in 
each genome in hexaploid wheat, consistent with the findings of Yamamori and Endo 

30 (1996) who located the SGP-A1. B1 and D1 proteins to the short arm of chromosome 
7. 
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PCR analysis was used to assign each of the cDNA clones to the individual wheat 
genomes. A single 365 bp PGR fragment was obtained from nullisomic-tetrasomic 
genomic DNA of Chinese Spring when primers ssllc and sslld were used for the PGR 
amplification (Figure 5. right panel). This PGR product is obtained only from lines 
5 bearing the B genome. The fragment was cloned and sequenced and shown to be 
identical to a 365 bp region of the wSSIIB cDNA. An identical fragment is obtained by 
PGR amplification of the wSSIIB cDNA clone, but not by amplification of the wSSIIA 
or wSSIID clones, supporting the conclusion that the wSSIIB cDNA is the product of 
a gene located on chromosome 7 of the B genome of hexaploid wheat. 

10 

Two PGR products were also amplified from nullisomic-tetrasomic genomic DNA of 
Ghinese Spring using the primers ssllc and sslle (Figure 5. left panel). One PGR 
fragment, approximately 350 bp is only amplified when the A genome is present, and 
a second 322 bp product Is only amplified when the D-genome is present. The 350 and 
15 322 bp PGR products were also cloned and sequenced and shown to be Identical to 
the wSSIlA and wSSIID cDNAs, respectively, supporting the conclusion that the 
wSSIIA and wSSIID cDNAs are the products of genes located on chromosomes 7A 
and 7D, respectively. 

20 EXAMPLE 12 

Isolation of genomic wSSII clones 

Screening of a genomic library from the D-genome donor of wheat, T. tauschii, was 
performed as described in Example 5, using the PGR-derived DNA fragment wSSIIp2 
as a hybridisation probe. A positive-hybridising clone, designated wSSII-8, and 
25 comprising a putative T. tauschii homologue of the wSSII gene, was isolated. 

Positive-hybridising plaques were digested using the restriction enzyme SamHI, 
separated on a 1% agarose gel, transferred to nitrocellulose membrane and hybridised 
to probe wSSIIp4 comprising nucleotides 1 to 367 of the wSSIIA cDNA clone, using 
30 the conditions described by Rahman at. (1997). Clone wSSIi-8 also hybridises 
strongly to the wSSIIp4 probe, confirming its identity as a genomic wSSII gene. 
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The complete nucleotide sequence of the wSSII gene was determined and is 
presented herein as SEQ ID NO: 37. The structural features of this gene are present 
in Table 3. A schematic representation of the intron/exon organisation of this gene is 
also presented in Figure 6. 



TABLE 3 

Structural features of the wheat starch synth 
Nucteotlde Position Feature 



ase II genomic gene 



Length (bases) 



in SEQ ID NO: 37 



10 1- 1416 



1417-1743 



1480-1482 
1 744 - 1 847 

1848-2553 
15 I 2554-2641 
2642 - 2706 
2707 - 3606 



5'-untranscribed region and 
promoter sequence 
exon 1 



1416 



327 



translat ion start codon (ATG) | 3 
intron 1 I 104 



exon 2 
intron 2 
exon 3 
intron 3 



706 
88 
65 
900 



3607 - 3684 



exon 4 



78 



3685 - 3773 



intron 4 



89 



20 3774 - 3884 



3885 - 3981 



exon 5 



111 



intron 5 



97 



3982 - 4026 



exon 6 



45 



4027 - 4406 



intron 6 



380 



4407 - 4580 



exon 7 



174 



25 4581 - 7296 



7297 - 8547 



intron 7 



2716 



exon 8 



1251 



8251 -8253 



translation stop codon (TGA) 3 
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EXAMPLE 13 

Cloning of specific cDNA regions of wheat starch synthase III using RT-PCR 

PGR primers were used to amplily sequences of starch synthase III from wheat 
endospenn cDNA. The design of PGR primers was based on the sequences of starch 
5 synthase III from potato and the dul starch synthase Mi gene of maize. 

First-strand cDNAs were synthesised from 1 ^ug of total RNA (derived from endosperm 
of the cultivar Rosella. 12 days after anthesis) as described by Maniatis et al. (1 982), 
and then used as templates to amplify two specific cDNA regions, wSSIIIpl and 
10 wSSIIIp2, of wheat starch synthase III by PGR. 

The primers used to obtain the cDNA clone wSSIIIpl were as follows: 
Primer wSS3pa (5* GGAGGTCTTGGTGATGTTGT 3": SEQ ID NO: 29); and 
Primer wSS3pb (5* CTTGACCAATCATGGCAATG 3': SEQ ID NO: 30). 

15 

The primers used to obtain the cDNA clone wSSIIIp2 were as follows: 
Primer wSS3pc (5' CATTGCCATGATTGGTCAAG 3': SEQ ID NO: 31); and 
Primer wSS3pd (5' ACCACCTGTCCGTTCGGTTGC 3': SEQ ID NO: 32). 

20 The amplified clones wSSIIIpl and wSSIIIp2 were used as probes to screen the third 
cDNA library and 7. tauschii genomic DNA library as described in Example 4. 

A further probe designated wSSIIIp3 was used for screening the third cDNA library, as 
described in Example 4. Probe wSSIIIp3 was amplified by PGR from a cDNA clone 
25 produced from the first screening using the following amplification primers: 

Primer wSS3pe (5' GCACGGTCTATGAGAACAATGGC 3': SEQ ID NO; 33); and 
Primer wSS3pf (5' TCTGCATACCACCAATCGCCG 3': SEQ ID NO; 34). 

The amplification reactions were performed using a FTS-1 or FTS4000 thermal 
30 sequencer (Corbett. Australia) for 1 cycle of 95°C for 2 minutes; 35 cycles of 95°C for 
30 seconds, 60°C for 1 minutes, 72°C for 2 minutes and 1 cycle of 25°C for 1 minute. 
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Amplified sequences of the expected length were obtained, cloned and sequenced, 
and shown to contain DNA sequences highly homologous to the maize and potato 

• 

SSIII genes. PGR fragments were subsequently used to probe a wheat cDNA library 
5 by DNA hybridisation and 8 positive clones were obtained, including one 3 kb cDNA. 
A region from the 5' end of this cDNA was amplified by PGR and used a probe for a 
second round of screening the cDNA library, obtaining 8 cDNA clones. Of these, one 
cDNA was demonstrated to be full length (wSSIII.B3, 5.36 kb insert). The sequence 
of the 5.346 bp WSSIII.B3 cDNA clone is given in SEQ ID NO;7. 

10 

Sequencing of the 8 cDNA clones obtained from the second round screening of the 
wheat cDNA library revealed that there were at least 2 classes of cDNA encoding SSIII 
present, possibly being encoded by homeologous genes on different wheat genomes. 
The sequence of a representative of this second class of cDNA clones, wSSIII.BI , is 
15 shown in SEQ ID N0:9. The 3261 bp clone wSSIII.BI is not full length, however it is 
similar to nucleotides 1739 to 5346 of the homeologous clone wSSIII.B3 (SEQ ID NO: 
7). Clone wSSIII.BI has an open reading frame between nucleotide positions 1 and 
3177. 

20 An open reading frame is found in the cDNA clone wSSIII.B3 (SEQ ID NO:7). in the 
region between position 29, commencing the ATG start codon, and nucleotide position 
4912. The amino acid sequence deduced from this open reading frame is shown in 
SEQ ID NO:8. 

25 An alignment of the deduced amino acid sequences of SSIII from maize, potato and 
wheat is shown in Figure 7. There is about 56.6% identity between the maize SSIII and 
wheat WSSIII.B3 sequence at the amino acid level. 

The C-terminal domain of starch synthases comprise the catalytic domain, and a 
30 characteristic amino acid sequence motif KVGGLGDWTSLSRAVQDLGHNVEV (SEQ 
ID NO: 35) in maize, or alternatively KVGGLGDWTSLSRAIQDLGHTVEV (SEQ ID 
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NO; 36) in wheat, marking the first cpnsen/ed region in the C-terminal domain. This 
amino acid sequence is present at amino acid residues 1 194 to 1218 of SEQ ID NO: 
8. 



5 The amino acid identity between maize dulll and wSSIII.B3 in the N-terminal region 
(i.e. amino acids 1 to 600 in Figure 7) is only 32.2%; whilst the amino acid identity in 
the central region (i.e. amino acids 601 to 1248 in Figure 7) is 68.4%; and in the C- 
terminal region (i.e. amino acids 1249 to 1631 In Figure 7) Is 84.6%. Accordingly, the 
SSI II starch synthases are much more highly conserved between maize and wheat in 
10 the region comprising the catalytic domain of the proteins. 

EXAMPLE 14 

Analysis of wheat starch synthase III mRNA expression 

Figure 8 shows the expression of wSSIII mRNA during endosperm development in two 
15 wheat varieties grown under defined environmental conditions. The expression of the 
gene is seen very eariy in endosperm development in both cultivars, 4 days after 
anthesis (Figure 8, panels a and b). Expression in the leaf of the variety Gabo is very 
weak (Figure 8, panel c, Lane L) whereas strong expression is seen in pre-anthesis 
florets (Figure 8, panel c. Lane P). 

20 

EXAMPLE 15 
Amino acid sequence comparisons between 
wheat SSII and SSIIi polypeptides 

Amino acid sequence comparisons between wheat BSSS, SSI, SSII and SSIII 
25 polypeptides reveals eight highly-conserved domains (Figure 9). The amino acid 
sequences of these domains are represented in the wheat SSIII amino acid sequence 
by the following sequence motifs: 

(a) Region 1 : KVGGLGDWTS; 

(b) Region 2: GHTVEVILPKY; 

30 (c) Region 3: HDWSSAPVAWLYKEHY; 

(d) Region 4: GILNGIDPDIWDPYTD; 
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(e) Region 5: DVPIVGIITRLTAQKG; 

(f) Region 5a: NGQWLLGSA; 

(g) Region 6: AGSDFIIVPSIFEPCGLTQLVAMRYGS; and 

(h) Region 7; TGGLVDTV. 

5 

These consen/ed amino acid sequences are summarised in Table 4. As shown in 
Table 4 below, there is at least about 25% amino acid sequence identity, preferably 
at least about 30% amino acid sequence identity, more preferably at least about 35% 
amino acid sequence identity, more preferably at least about 40% amino acid 

10 sequence identity, more preferably at least about 45% amino acid sequence identity, 
more preferably at least about 50% amino acid sequence identity, more preferably at 
least about 55% amino acid sequence identity, more preferably at least about 60% 
amino acid sequence identity, more preferably at least about 65% amino acid 
sequence identity, more preferably at least about 70% amino acid sequence identity, 

15 more preferably at least about 75% amino acid sequence identity, more preferably at 
least about 80% amino acid sequence identity, more preferably at least about 85% 
amino acid sequence identity, nnipre preferably at least about 90% amino acid 
sequence identity and even more preferably at least about 95% amino acid sequence 
identity between the amino acid sequences of plant starch synthase enzymes, in 

20 particular wheat starch synthases. 

From the data presented in Table 4, the most conserved regions of the wheat SSII 
and SSIII polypeptides are a region of 6 or 7 identical amino acids in Region 1 and a 
region of 8 or 9 identical amino acids in Region 6. The lowest regions of identity are 
25 found in regions 3 and 5a. 

For each of the amino acid sequences presented in the first column of Table 4, which 
are specific for wSS I II polypeptides, corresponding signature motifs which are specific 
for wSSII-A. wSSII-B, and wSSII-D polypeptides can be derived from the alignment. 
30 as follows: 

Region 1: KTGGLGDVAGA; 
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Region 2: 


GHRVI\4WVPRY; 


Region 3: 


N DWHTALLPVYLKA Y Y; 


Region 4: 


GIVNGIDNMEWNPEVD; 


Region 5: 


DVPLLGFIGRLDGQKG- 


Region 5a: 


DVQLVMLGTG; 


Region 6: 


AGADALLMPSRF(EA/)PCGLNQLYAI\^AYGT; and 


Region 7: 


VGG(V/L)RDTV. 



Comparison of the amino acid sequences of all available starch synthases with the 
10 deduced amino acid sequences of the three wSSII cDNA clones of the present 
invention (i.e. wSSIIB. wSSIIA and wSSIID) was conducted using PILEUP analysis 
(Devereaux etal., 1984) and data are presented herein as a dendrogram (Figure 10). 
The sequence of the glycogen synthase of E. coli was also included. Based upon their 
amino acid similarities, four classes of plant starch synthases can be defined: GBSS. 
15 SSI. SSII andSSIII. 

Table 5 shows that levels of identity at the amino. acid level between the wSSII 
sequences, as detemnined using the BESTFIT programme in GCG (Devereaux et af., 
1984). and other class II starch synthases range from 70% identity with potato SSII to 
20 85% identity with maize SSIIa. Both wSSIIB and wSSIID showed significantly higher 
homology to maize SSIIa than wSSI|A. Based upon sequence identities and the 
function of the Sgp-1 proteins in wheat, the wSSIIB. wSSIIA and wSSID cDNA clones 
are members of the starch synthase II (SSII) group and are more similar in sequence 
to maize SSIIa than maize SSIIb. 

25 
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TABLE 4 

Identities between conserved motifs of plant starch synthases 



Sequence in wSSIIi 
polypeptide 



5 I Region 1 : 

KVGGLGD WTS 
Region 2: 
GHTVEVILPKY 
Region 3: 
10 I HDWSSA PVAWLYKEHY 
Region 4: 

GILNGIDPDIWDPYTD 
Region 5: 

DVPIVGIIT RLTAQKG 
15 I Region 5a; 

NGQVVLL GSA 
Region 6; 

AGSDFIIVPSIFEPCGLT 
QLVAMRYGS 



Number of conserved Number of conserved 



residues between wheat 
starch synthases 



6/11 residues 



6/11 residues 



4/16 residues 



7/16 residues 



8/16 residues 



4/10 residues 



15/27 residues 



20 I Region 7: 
TGGLVDTV 



5/9 residues 



residues between 
wheat SSII and SSiii 
polypeptides 



6/11 residues 



6/11 residues 



5/16 residues 



8/16 residues 



10/16 residues 



4/10 residues 



17/27 residues 



5/9 residues 
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TABLE 5 





Wooll-A 


WoSII-D 


wSSli-D 


WdOl-A 


1U0% 






WOOII-D 


yo.y 70 


1 00% 




wSSII-D 


96.3% 


96.7% 


100% 


maize SSIla 


76.1% 


85.2% 


84.7% 


maize SSilb 


76.3% 


76.7% 


75.9% 


pea SSII 


72.0% 


72.2% 


71.8% 


potato SSII 


70.9% 


71.1% 


70.3% 



10 

Figure 1 1 shows a schematic representation of an alignment of plant starch synthase 
sequences, including wheat GBSS, wheat SSI. wheat SSII-A1. maize SSIla. and 
maize dull-1 polypeptides, in which the position of the first homologous region, 
comprising the consensus motif KXGG, is used as the basis of the alignment. The 

15 major differences in structure between the classes of genes are found in the length of 
the N-terminal region between the transit peptide and the first conser/ed region. At 
one extreme, the GBSS genes have a very short N-terminal arm, whereas the du1 
starch synthase contains a very long N-terminal extension containing several distinct 
regions. The wSSII genes contain an N-terminal extension which is longer than either 

20 GBSS, SSI, or SSIIb, and slightly longer than the maize SSIla gene. 

EXAMPLE 16 
Isolation of genomic clones for SSIII 
Screening of a genomic library from the D-genome donor of wheat,* 7. tauschii, 
25 identified a number of clones which hybridised to the wSSIII PGR fragment. Positive 
plaques in the genomic library were selected as those hybridising with a probe that had 
been generated by PGR (amplifying between nucleotide positions 3620 to 3966) from 
the SSIII cDNA as template. The primer sequences used were as follows: 
wSS3pa (5' GGAGGTCTTGGTGATGTTGT 3': SEQ ID NO: 29); and 
30 wSS3pb (5* CTTGACCAATCATGGCAATG 3* : SEQ ID NO: 30). 
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Hybridisation was carried out in 25% formamide. 6 x SSC. 0.1% SDS at 42 for 16 
hour, tiien washed three times with 2 x SSC containing 0.1 % SDS at 65 "C, for 1 hour 
per wash, shows an example of a plaque lift showing positive and negative 
hybridisations for plaques containing the T. tauschii homologue of the wSSIII.B3 gene. 



5 



DNA was isolated from positive-hybridising A clones using methods described by 
Maniatis et al. Briefly. DNA was digested using BamH\ or Bg/I and sub-cloned in to the 
vector pJKKmfm. DNA sequencing was perfomied using the automated ABI system 
with dye terminators as described by the manufacturers. DNA sequences were 
10 analysed using the GCG suite of programs (Devereaux et al., 1984). 

Nucleotide sequences of the genomic SSI 1 1 clone from T. tauschii are provided herein 
as 6 contiguous sequences designated fragments 1 to 6 (SEQ ID NOs: 11 to 16, 
respectively). Table 6 defines the relative positions of these fragments with respect to 
15 the SSI 1 1 cDNA and describes the positions of exons. Figure 1 1 shows this information 
schematically. 

The complete nucleotide sequence of a wheat SSHI genomic gene is presented herein 
as SEQ ID NO: 38. The structural features of this gene are presented in Table 7. A 
20 schematic representation of the intron/exon organisation of this gene is also presented 
in Figure 12. 



EXAMPLE 17 
Discussion 

25 Early work on the Sgp-1 starch synthase proteins (Denyer et a!., 1995; Rahman et al., 
1 995) was based on the localisation of these proteins in the wheat starch granule, and 
no definitive conclusion conceming their presence or absence in soluble extracts of the 
wheat endosperm was presented. 

30 We have now demonstrated that a monoclonal antibody against the Sgp-1 proteins 
cross reacts strongly with those starch synthase proteins having apparent molecular 
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weights of 100-105 kDa in soluble extracts, however, the appearance of these proteins 
in soluble extracts is dependant on the developmental stage of the endosperm 
material. Whilst the proteins can be detected in the soluble phase in early to mid 
endosperm development, little or no soluble protein remains in late endosperm 
5 development (Figure 1). This observation accounts for the failure of Rahman et al. 
(1995) to detect the protein in soluble extracts in a previous report. 

Based upon the localisation of the Sgp-1 starch synthase proteins in the wheat 
endospenn. the following nomenclature is suggested for wheat starch synthase 
10 enzymes: wGBSS for the 60 kDa granule bound starch synthase (Wx); wSSi for the 
75 kDa starch synthase I (Sgp-3): wSSII for the 100 - 105 kDa proteins (Sgp-1); and 
wSSIII for a soluble high molecular starch synthase. 

The present invention provides cDNA and genomic clones encoding the wSSII and 

15 wSSIII polypeptides and the corresponding genomic clones. Whilst the evidence is 
compelling that the wSSIIA, wSSIIB and wSSIID cDNAs encode the Sgp-AI. Sgp-BI 
and Sgp-DI proteins of the wheat starch granule, molecular weights calculated from 
the deduced amino acid sequences of the clones are considerably lower than 
estimates obtained from SDS-PAGE. The molecular weight of the precursor wSSIIA 

20 protein is 87.229 Da, and the mature protein 81.164 Da. yet the estimated molecular 
weight in our experience is 105 kDa. The assignment of the wSSIIA cDNA to the A- 
genome of wheat is demonstrated in Figure 5, and the assignment of the 105 kDa 
protein to the A-genome in Denyer et al. (1995) and Yamamori and Endo (1996). 
Similarly, the molecular weight of the wSSliB protein is 86,790 Da and the mature 

25 protein 80,759 Da, yet the molecular weight of the Sgp-BI protein is estimated to be 
100 kDa. No comparison can be made of the wSSIID sequences as a full length cDNA 
clone was not obtained. The wSSIIA and wSSIIB amino acid sequences differ by just 
a single amino acid residue, yet there is an apparent difference of 5 kDa in molecular 
weight when estimated by SDS-PAGE. Several possibilities can be advanced to 

30 account for this apparent discrepancy in molecular weights. Firstly, the wSSII proteins 
may not migrate in SDS-PAGE in accordance with their molecular weight because they 
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retain some conformation under the denaturing conditions used. Secondly, the proteins 
may be glycosylated. It is also possible that the proteins may be non-covalently linked 
to starch through a high affinity starch binding site which survives denaturation and 
SDS-PAGE. Differences between the apparent molecular weights and those calculated 
5 from the deduced amino acid sequences will have to be defined in establishing the 
relationship between the wSSII proteins and proteins encoded by the analogous SSI I 
genes of other species. 

The catalytic domain of the starch synthases is found at the C-terminal end of the 
10 protein (Gao etaL, 1998; Ham etaL, 1998). Harn etal. (1998) identified 7 conserved 

regions among SSIIa, SSIIb. SSI and GBSS sequences. We have identified an 

additional consen/ed region (designated region 5a in Table 4 and Figure 10) 

comprising the amino acid sequence motif DVQLVMLGTG, by a comparison of the . 

wSSII and wSSIII sequences of the present Invention with differing isoforms of other 
1 5 plant starch synthases (GBSS, SSI , SSII and SSIII). The conservation of eight peptide 

regions among the 4 classes of starch synthases is striking, in terms of their sequence 

homologies and their alignment. 

Analysis of the wheat SSII genes shows that there is a motif, PVNGENK, which is 
20 repeated. The area surrounding the repeated PVNGENK motif is not homologous to 
maize SSIla and the insertion of this region is responsible for the difference in length 
between the wheat SSII and maize SSIla genes. In pea and potato SSII polypeptides, 
a RPR motif (Figure 3; residues 251-253 and 287-289 respectively) has been 
suggested to mark the end of the N-terminal region and to facilitate the flexibility of an 
25 "N-terminal arm". This motif is not found in either the maize or wheat SSII sequences. 

The generation of a wheat line combining null alleles at each of the three wSSII loci, 
wSSIIA. wSSlIB and wSSIID, has been reported recently by Yamamori (1998). in this 
triple null line, the large starch granules were reported to be mostly deformed and a 
30 novel starch with high blue value was observed when stained with iodine, indicating 
that wSSlI is a key enzyme for the synthesis of starch in wheat. Further analysis of the 
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starch derived from this triple null mutant is in progress. 

Mutations in starch synthases are known in three other species. In pea, mutation in 
SSI I gives rise to starch with altered granule morphology and an amylopectin which 
5 yields an oligosaccharide distribution with reduced chain length on debranching, 
compared to the wild type (Craig et aL, 1998). A similar mutation in a gene designated 
SSI! is known in Chlamydomonas (the sta-3 mutation) and similar effects on granule 
morphology and amylopectin structure are observed (Fontaine et ai, 1993). In maize, 
two mutations affecting starch synthases are known. First, the duU1 mutation has been 

10 shown to be caused by a lesion within the du^ SS Ill-type starch synthase gene (Gao 
et a/., 1998). A second mutation, the sugary-2 mutation yields a starch with reduced 
amylopectin chain lengths on debranching (this mutation co-segregates with the SSI la 
locus (Harn et aL, 1998) although direct evidence that the sugary-l mutation is caused 
by a lesion in the SSIIa gene is lacking). In the SSII mutants of each of these species, 

15 amylose biosynthesis capacity is retained, suggesting different roles in amylose and 
amylopectin synthesis for the GBSS and SSII genes. Given the consen/ation in overall 
organisation of the GBSS and SSII genes (see Figures 12 and 13), when an alignment 
is made based on the KTGGL motif of the first conserved region, this focuses attention 
on the role(s) of the N-terminal region in defining substrate specificity and the 

20 localisation of the proteins as the N-terminal region is the major area of divergence 
between the 4 classes of starch synthases. However, it is premature to exclude the 
influence of more subtle mutations in central and C-terminal regions of the gene. 

The cloning of the wSSII and wSSIII cDNAs and genomic clones described herein 
25 provides useful tools for the further study of the roles of the starch synthases in wheat. 
Firstly, they provide a source of markers which can be used to recover and combine 
null or divergent alleles. Secondly, genetic manipulation of wheat by gene suppression 
or over-expression can be carried out, and the genes may be used for over expression 
in other species. The promoter regions of these genes are also useful in regulating the 
30 expression of starch synthase genes and other heterologous genes in the developing 
wheat endosperm and in pre-anthesis florets of wheat. 
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TABLE7 

Structural features of the wheat starch synthase 111 genomic gene 





Nucleotide Position 


j Feature 


Length (bases) 




1 in SEQ ID NO: 38 






5 


1 1-973 


5'-untranscribed region and 
1 promoter sequence 


1 973 




1 974 - 1 099 


exon 1 


1 126 




1 1001-1003 


1 translation start codon (ATG) 


1 ^ 




1 1100-2056 


intron 1 


1 957 




1 2057-2120 


I exon 2 


1 64 


10 


2121 -2588 


intron 2 


468 




2589 - 5291 


exon 3 


2703 




5292 - 5549 


intron 3 | 


258 




5550 - 5767 | 


exon 4 | 


218 




5768-6103 1 


intron 4 


336 


15 


6104-6374 1 


exon 5 . | 


271 




6375-7148 | 


intron 5 | 


774 




7149-7324 | 


exon 6 


176 




7325 - 7438 | 


intron 6 


114 




7439 - 7546 | 


exon 7 j 


108 


20 


7547 . 7792 | 


intron 7 


246 




7793 - 7902 


axon 8 


110 




7903 - 8797 


intron 8 


895 




8798 - 8900 


exon 9 


103 




8901 - 9164 


intron 9 


264 


25 


9165 - 9335 


exon 10 


171 




9336 - 9460 


intron 10 


125 




9461 - 9589 


exon 1 1 


129 




9590 - 9677 


intron 1 1 


88 
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CLAIMS: 

1 . An isolated nucleic acid molecule which comprises a sequence of nucleotides selected 
from the group consisting of: 

(i) the nucleotide sequence set forth in SEQ ID NO; 1 or the protein-encoding 
region thereof or a degenerate nucleotide sequence thereto; 

(ii) the nucleotide sequence set forth in SEQ ID NO: 3 or the protein-encoding 
region thereof or a degenerate nucleotide sequence thereto; 

(iii) the nucleotide sequence set forth in SEQ ID NO: 5 or the protein-encoding 
region thereof or a degenerate nucleotide sequence thereto; 

(iv) the nucleotide sequence set forth in SEQ ID NO: 7 or the protein-encoding 
region thereof or a degenerate nucleotide sequence thereto; 

(v) the nucleotide sequence set forth in SEQ ID NO: 9 or the protein-encoding 
region thereof or a degenerate nucleotide sequence thereto; 

(vi) the nucleotide sequence set forth in SEQ ID NO: 1 1 or the protein-encoding 
region thereof or a degenerate nucleotide sequence thereto; 

(vii) the nucleotide sequence set forth in SEQ ID NO: 12 or the protein-encoding 
region thereof or a degenerate nucleotide sequence thereto; 

(viii) the nucleotide sequence set forth in SEQ ID NO: 13 or the protein-encoding 
region thereof or a degenerate nucleotide sequence thereto; 

(ix) the nucleotide sequence set forth in SEQ ID NO: 14 or the protein-encoding 
region thereof or a degenerate nucleotide sequence thereto; 

(x) the nucleotide sequence set forth in SEQ ID NO: 15 or the protein-encoding 
region thereof or a degenerate nucleotide sequence thereto; 

(xi) the nucleotide sequence set forth In SEQ ID NO: 16 or the protein-encoding 
region thereof or a degenerate nucleotide sequence thereto; 

(xii) the nucleotide sequence set forth in SEQ ID NO: 37 or the protein-encoding 
region thereof or a degenerate nucleotide sequence thereto; 

(xiii) the nucleotide sequence set forth in SEQ ID NO: 38 or the protein-encoding 
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region thereof or a degenerate nucleotide sequence thereto; 

(xiv) the nucleotide sequence set forth in SEQ ID NO: 1 1 or the protein-encoding 
region thereof or a degenerate nucleotide sequence thereto; 

(XV) a nucleotide sequence which encodes a wheat starch synthase polypeptide 
hereinbefore defined wherein said nucleotide sequence has at least about 85% identity 
overall to any one of (i) to (xiv); and 

(xvi) a nucleotide sequence which is complementary to any one of (i) to (xv). 



as 



The isolated nucleic acid molecule according to claim 1 wherein the wheat starch 
synthase polypeptide further comprises one or more amino acid sequences selected 
from the group consisting of: 

(a) KVGGLGDWTS; 

(b) GHTVEVILPKY; 

(c) HDWSSAPVAWLYKEHY; 

(d) GiLNGIDPDIWDPVTD; 

(e) DVPIVGIITRLTAQKG; 
(0 NGQWLLGSA; 

(g) AGSDFIIVPSIFEPCGLTQLVAMRYGS; 

(h) TGGLVDTV: 

(i) KTGGLGDVAGA; 
0) GHRVMVWPRY; 

(k) NDWHTALLPVYLKAYY; 
(I) GIVNGIDNMEWNPEVD; 
(m) DVPLLGFIGRLDGQKG; 
(n) DVQLVMLGTG; 
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(o)AGADALLMPSRF(EA/)PCGLNQLYAMAYGT; and 
{p)VGG{V/L)RDW. 

3. The isolated nucleic acid molecule according to claim 2 wherein the wheat starch 
synthase polypeptide comprises at least three of said amino acid sequences selected 
from the group consisting of (a) to (h). 

4. The isolated nucleic acid molecule according to claim 2 wherein the wheat starch 
synthase polypeptide comprises at least six of said amino acid sequences selected 
from the group consisting of (i) to (p). 

5. The isolated nucleic acid molecule according to claim 1 encoding a wheat starch 
synthase II polypeptide. 

6. The isolated nucleic acid molecule according to claim 1 encoding a wheat starch 
synthase III polypeptide. 

7. An isolated nucleic acid molecule encoding a starch synthase polypeptide which 
comprises one or more amino acid sequences selected from the group consisting of: 

(a) GHTVEVILPKY; 

(b) HDWSSAPVAWLYKEHY; 

(c) DVPIVGIITRLTAQKG; 

(d) NGQWLLGSA; 

{e)AGSDFllVPSIFEPCGLTQLVAMRYGS; 
(OTGGLVDTV; 

(g) GIVNGIDNMEWNPEVD; and 
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(h) AGADALLMPSRF(E/V)PCGLNQLYAMAYGT. 



8. The isolated nucleic acid molecule of claim 5 encoding a wheat starch synthase It 
polypeptide which comprises an amino acid sequence selected from the group 
consisting of: 



(i) SEQ ID NO: 2; 

(ii) SEQ ID NO: 4; 



(iii) SEQ ID NO: 6; and 



(iv) a homologue of any one of (i) to (iii) having at least about 85% identity thereto. 



9. The isolated nucleic acid molecule of claim 6 encoding a wheat starch synthase III 
polypeptide which comprises an amino acid sequence selected from the group 
consisting of: 

r 

(i) SEQ ID NO: 8; 

(ii) SEQ ID NO: 10; and 

(iii) a homologue of (i) or (ii) having at least about 85% identity thereto. 



10. A probe or primer comprising at least about 1 5 contiguous nucleotides in length derived 
from the nucleotide sequence according to claim 1 . 



1 1 . The probe or primer according to claim 1 0 comprising a nucleotide sequence selected 
from the group consisting of: 

(i) the nucleotide sequence set forth in SEQ ID NO: 25; 

(ii) the nucleotide sequence set forth in SEQ ID NO: 26; 

(iii) the nucleotide sequence set forth in SEQ ID NO: 27; 
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(iv) the nucleotide sequence set forth in SEQ ID NO: 28; 

(v) the nucleotide sequence set forth in SEQ ID NO: 29; 

(vi) the nucleotide sequence set forth in SEQ ID NO: 30; 

(vii) the nucleotide sequence set forth in SEQ ID NO: 31 ; 

(viii) the nucleotide sequence set forth in SEQ ID NO: 32; 
(fx) the nucleotide sequence set forth in SEQ ID NO: 33; 

(x) the nucleotide sequence set forth in SEQ ID NO: 34; 

(xi) a nucleotide sequence which encodes an amino acid sequence selected from 
the group consisting of: 

(a) KVGGLGDWTS: 

(b) GHTVEVILPKY; 

(c) HDWSSAPVAWLYKEHY; 

(d) GILNGIDPDIWDPYTD; 

(e) DVPIVGIITRLTAQKG; 

(f) NGQWLLGSA; 

(g) AGSDFIIVPSIFEPCGLTQLVAMRYGS; 

(h) TGGLVDTV; 

(i) KTGGLGDVAGA; 
(j) GHRVMVWPRY; 

(k) NDWHTALLPVYLKAYY; 
(I) GIVNGIDNMEWNPEVD; 
(m) DVPLLGFIGRLDGQKG; 
(n) DVQLVMLGTG; 

(o)AGADALLMPSRF(E/V)PCGLNQLYAMAYGT; and 
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(p)VGG(V/L)RDTV; 

(xii) a nucleotide sequence comprising at least about 15 contiguous nucleotides of 
an intron region of SEQ ID NO: 37; 

(xiii) a nucleotide sequence comprising at least about 15 contiguous nucleotides of 
an intron region of SEQ ID NO: 38; and 

(xiv) a nucleotide sequence which is complementary to any one of (i) to (xiii). 

12. An isolated or recombinant polypeptide, protein or enzyme comprising an amino acid 
sequence selected firom the following: 

(i) the amino acid sequence set forth in SEQ ID NO: 2 or the mature protein region 
thereof; 

(ii) the amino acid sequence set forth in SEQ ID NO: 4 or the mature protein region 
thereof; 

(iii) the amino acid sequence set forth in SEQ ID NO: 6 or the mature protein region 
thereof; 

(iv) the amino acid sequence set forth in SEQ ID NO: 8 or the mature protein region 
thereof; 

(v) the amino acid sequence set forth in SEQ ID NO: 10 or the mature protein 
region thereof; 

(vi) a wheat starch synthase polypeptide having at least about 85% identity overall 
to any one of (i) to (v). 

1 3. The isolated or recombinant polypeptide according to claim 12 further comprising one 
or more amino acid sequences selected from the group consisting of: 

(a) KVGGLGDWTS; 

(b) GHTVEVILPKY; 

(c) HDWSSAPVAWLYKEHY: 
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(d) GILNGIDPDIWDPYTD; 

(e) DVPIVGIITRLTAQKG; 
(0 NGQWLLGSA; 

(g) AGSDFIIVPSIFEPCGLTQLVAMRYGS; 

(h) TGGLVDTV; 

(i) KTGGLGDVAGA; 
G) GHRVMWVPRY; 

(k) NDWHTALLPVYLKAYY; 
(I) GIVNGIDNMEWNPEVD; 
(m) DVPLLGFIGRLDGQKG; 
(n) DVQLVMLGTG; 

(o)AGADALLMPSRF(EA/)PCGLNQLYAMAYGT; and 
(p)VGG(V/L)RDTV. 

1 4. The isolated or recombinant polypeptide according to claim 1 3 wherein the wheat starch 
synthase polypeptide comprises at least three of said amino acid sequences selected 
from the group consisting of (a) to (h). 

1 5. The isolated or recombinant polypeptide according to claim 1 3 wherein the wheat starch 
synthase polypeptide comprises at least six of said amino acid sequences selected 
from the group consisting of (i) to (p). 

1 6. The isolated or recombinant polypeptide according to claim 1 2 encoding a wheat starch 
synthase II polypeptide. 
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1 7. The isolated or recombinant polypeptide according to claim 1 2 encoding a wheat starch 
synthase III polypeptide. 

1 8. An isolated or recombinant starch synthase polypeptide which comprises one or more 
amino add sequences selected from the group consisting of: 

(a) GHTVEVILPKY; 

(b) HDWSSAPVAWLYKEHY; 

(c) DVPIVGIITRLTAQKG; 

(d) NGQWLLGSA; 

(e) AGSDFIIVPSIFEPCGLTQLVAMRYGS; 

(f) TGGLVDTV; 

(g) GIVNGIDNMEWNPEVD; and 

(h) AGADALLMPSRF{EA/)PCGLNQLYAMAYGT. 

19. The isolated or recombinant polypeptide according to claim 16 consisting of a wheat 
starch synthase II polypeptide which comprises an amino acid sequence selected from 
the group consisting of: 

(i) SEQ ID NO: 2; 

(ii) SEQ ID NO: 4; 

(iii) SEQ ID NO: 6; and 

(iv) a homoiogue of any one of (i) to (iii) having at least about 85% identity thereto. 



20. 



The isolated or recombinant polypeptide according to claim 17 consisting of a wheat 
starch synthase III polypeptide which comprises an amino acid sequence selected from 
the group consisting of: 
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(i) SEQ ID NO: 8; 

(ii) SEQ ID NO: 10; and 

(iii) a homologue of (i) or (ii) having at least about 85% identity thereto. 

21. The isolated or recombinant polypeptide according to claim 12 substantially free of 
conspecific or non-specific proteins. 

22. A method comprising: 

(i) hybridising single-stranded or double-stranded mRNA, cDNA or genomic DNA 
with a nucleotide sequence selected from the group consisting of: 

(a) the nucleotide sequence according to any one of claims 1 to 9; 

(b) a probe or primer derived from a nucleotide sequence according to sub- 
paragraph (a) and comprising at least about 15 contiguous nucleotides of said 
nucleotide sequence in length; and 

(ii) detecting the hybridised mRNA, cDNA or genomic DNA using a detecting 
means. 

23. The method according to claim 22 wherein the detecting means consists of a reporter 
molecule covalently attached to the probe or primer molecule. 

The method according to claim 22 wherein the detecting means consists of a 
polymerase chain reaction. 

The method according to claim 22 wherein the probe or primer comprises a nucleotide 
sequence selected from the group consisting of: 

(i) the nucleotide sequence set forth in SEQ ID NO: 25; 



24. 



25. 
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(ii) the nucleotide sequence set forth in SEQ ID NO: 26; 

(iii) the nucleotide sequence set forth in SEQ ID NO: 27; 

(iv) the nucleotide sequence set forth in SEQ ID NO: 28; 

(v) the nucleotide sequence set forth in SEQ ID NO: 29; 

(vi) the nucleotide sequence set forth in SEQ ID NO: 30; 

(vii) the nucleotide sequence set forth in SEQ ID NO: 31; 

(viii) the nucleotide sequence set forth in SEQ ID NO: 32; 

(ix) the nucleotide sequence set forth in SEQ ID NO: 33; 

(x) the nucleotide sequence set forth in SEQ ID NO: 34; 

(xi) a nucleotide sequence which encodes an amino acid sequence selected from 
the group consisting of: 

(a) KVGGLGDWTS; 

(b) GHTVEVILPKY; 

(c) HDWSSAPVAWLYKEHY; 

(d) GILNGIDPDIWDPYTD; 

(e) DVPIVGIITRLTAQKG; 

(f) NGQWLLGSA; 

(g )AG SDFIIVPSIFEPCG LTQLVAM R YGS ; 

(h) TGGLVDTV; 

(i) KTGGLGDVAGA; 
(j) GHRVMWVPRY; 

(k) NDWHTALLPVYLKAYY; 
(I) GIVNGIDNMEWNPEVD; 
(m) DVPLLGFIGRLDGQKG; 
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(n) DVQLVMLGTG; 

(o)AGADALLMPSRF(EA/)PCGLNQLYAMAYGT; and 
(p)VGG(V/L)RDTV; 

(xii) a nucleotide sequence comprising at least about 15 contiguous nucleotides of 
an intron region of SEQ ID NO: 37; 

(xiii) a nucleotide sequence comprising at least about 15 contiguous nucleotides of 
an intron region of SEQ ID NO: 38; and 

(xiv) a nucleotide sequence which is complementary to any one of (i) to (xiii). 

26. A method of assaying for the presence or absence of a wheat starch synthase 
polypeptide in a plant or a plant extract or isolated nucleic acid sample, said method at 
least comprising performing the method according to any one of claims 22 to 25. 

27. The method according to claim 26 further comprising preparing the plant extract or 
nucleic acid sample. 

— V 

28. A method of marker-assisted breeding and/or selection of a plant at least comprising 
performing the method according to any one of claims 22 to 25. 

29. The method according to claim 28 further comprising selecting a plant which expresses 
a desirable wheat starch synthase characteristic. 

30. The method according to claim 28 further comprising crossing a plant which expresses 
a desirable wheat starch synthase characteristic to another plant. 

31 . The method according to claim 30 further comprising selecting progeny of the cross 
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which expresses a desirable wheat starch synthase characteristic. 

32. A plant produced by the method according to any one of claims 28 to 31 wherein said 
plant expresses a wheat starch synthase polypeptide at a desired level detectable using 
said method. 

33. A method of modifying the starch content and/or starch composition of one or more 
tissues or organs of a plant, comprising expressing in said plant a nucleic acid molecule 
for a time and under conditions sufficient for the en2yme activity of one or more starch 
synthase isoenzymes to be modified, wherein said nucleic acid molecule is selected 
from the group consisting of: 

(i) the isolated nucleic acid molecule according to any one of claims 1 to 9; 

(ii) a fragment of (i) which comprises a nucleotide sequence capable of being 
expressed to down-regulate the expression of an endogenous wheat starch synthase 
isoenzyme of said plant; and 

(iii) a fragment of (i) which encodes a functional wheat starch synthase isoenzyme 
of said plant. 

34. The method according to claim 33 wherein the fragment at sub-paragraph (ii) is an 
antisense molecule, ribozyme molecule, co-suppression molecule, or gene-targeting 
molecule. 

35. The method according to claim 33 further comprising introducing the nucleic acid 
molecule to an isolated plant cell, tissue, organ, or organelle. 

36. The method according to claim 35 further comprising regenerating an intact plant from 
the isolated plant cell, tissue, organ, or organelle carrying the introduced nucleic acid 
molecule. 
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37. The method according to claim 35 wherein the nucleic acid molecule is introduced to 
the plant cell, tissue, organ, or organelle by introgression. 



38- The method according to claim 35 wherein the nucleic acid molecule is introduced to 
the plant cell, tissue, organ, or organelle by transformation means. 



39. An isolated promoter sequence comprising a nucleotide sequence selected from the 
group consisting of: 

(i) nucleotides 1 to about 287 of SEQ ID NO: 1 1 ; 

(ii) nucleotides 1 to about 1416 of SEQ ID NO: 37; 

(iii) nucleotides 1 to about 973 of SEQ ID NO: 38; 

(iv) a fragment of any one of (i) to (iii) capable of conferring expression on a 
heterologous gene in a monocotyledonous plant cell, tissue or organ; and 

(v) a complementary nucleotide sequence to any one of (i) to (iv). 



40. The isolated promoter sequence according to claim 39 that is operable in the 
endosperm. 



41 . A plant carrying the isolated nucleic acid molecule according to any one of claims 1 to 
9 as an exogenous complement to its genome. 



42. A progeny of the plant according to claim 41 wherein said progeny carries the 
introduced nucleic acid molecule. 



A propagule of the plant according to claim 41 or 42 wherein said propagule carries the 
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introduced nucleic acid molecule present in said plant. 

A gene constnjct or vector which comprises the isolated nucleic acid molecule 
according to any one of claims 1 to 9 and one or more origins of replication. 

The gene constmct according to claim 44 further comprising a promoter 
sequence in operable connection with said isolated nucleic acid molecule. 

46. A gene construct or vector which comprises the probe or primer according to 
claim 10 or 1 1 and one or more origins of replication. 



44. 



45. 



A modified starch derived from the plant according to claim 32 or 41 wherein said starch 
is modified by virtue of the use of the isolated nucleic acid according to claim 1 to 
produce said plant. 



48. A modified starch derived from the progeny according to claim 42 wherein said starch 
is modified by virtue of the use of the isolated nucleic acid according to claim 1 to 
produce said progeny. 



49. A modified starch derived from the propagule according to claim 43 wherein said starch 
is modified by virtue of the use of the isolated nucleic acid according to claim 1 to 
produce said propagule. 



50. A food product comprising the modified starch according to any one of claims 47 to 49. 



51. 



The food product according to claim 50 consisting of flour or a flour-based food product. 
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52. The food product according to claim 50 or 51 selected from the group consisting of: 
flour-based sauce; leavened bread; unleavened bread; pasta, noodle; cereal; snack 
food; cake; and pastry. 



53. Use of the modified starch according to any one of claims 47 to 49 in the preparation 
of a food product for consumption by an animal or human. 



54. A modified protein derived from the plant according to claim 32 or 41 wherein said 
protein is modified by virtue of the use of the isolated nucleic acid according to claim 1 
to produce said plant. 



55. A modified protein derived from the progeny according to claim 42 wherein said protein 
is modified by virtue of the use of the isolated nucleic acid according to claim 1 to 
produce said progeny. 



56. A modified protein derived from the propagule according to claim 43 wherein said 
protein is modified by virtue of the use of the isolated nucleic acid according to claim 1 
to produce said propagule. 



57. A non-food product comprising the modified protein according to any one of claims 54 
to 56. 



58. The non-food product according to claim 57 selected from the group consisting of: films; . 
coatings; adhesives; building materials; and packaging materials. 



59. Use of the modified protein according to any one of claims 54 to 56 in the preparation 
of a non-food product 
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SEQUENCE LISTING 

^S^rilLoiK^l^^l^iS INDUSTRIAL RESEARCH ORGANISATION 

GROUPE LIMAGRAIN PACIFIC PTY LTD 

<120> NOVEL GENES ENCODING WHEAT STARCH SYNTHASES AND USES 
THEREFOR 

<130> p:\oper\inro\pi-wss.pct 

<140> TO BE ADVISED 
<141> 2000-04-28 

<150> AU PQ0052/99 
<151> 1999-04-29 

<160> 54 

<170> Patentin Ver. 2-0 

<210> 1 
<211> 2939 
<212> DNA 

<213> Triticum aestivum 

<220> 
<221> CDS 

<222> (176) (2569) 
<400> 1 

atttcctcgg cctgaccccg tgcgtttacc ccacacagag cacactccag tccagtccag 60 
cccactgccg cgctactccc cactcccact gccaccacct ccgcctgcgc cgcgctctgg 120 

gcggaccaac ccgcgcatcg tatcacgatc acccaccccg atcccggccg ccgcc atg 178 

Met 
1 

teg teg gcg gtc gcg tec gee geg tee ttc etc gcg etc gcg tec gee 226 
Ser Ser Ala Val Ala Ser Ala Ala Ser Phe Leu Ala Leu Ala Ser Ala 

5 10 15 

tec ecc ggg aga tea egg agg agg aeg agg gtg age gcg teg cea ece 27 4 
Sex Pro Gly Arg Ser Arg Arg Arg Thr Arg Val Ser Ala Ser Pro Pro 
20 25 30 

cac acc ggg get ggc agg ttg cac tgg ccg ecg teg ecg ecg cag cgc 322 
His Thr Gly Ala Gly Arg Leu His Trp Pro Pro Ser Pro Pro Gin Arg 
35 40 45 

aeg get cgc gae gga gcg gtg gee geg cgc gee gee ggg aag aag gae 370 
Thr Ala Arg Asp Gly Ala Val Ala Ala Arg Ala Ala Gly Lys Lys Asp 
SO 55 60 65 

gcg ggg ate gae gae gee gcg ecc geg agg cag eec cgc gea etc cgc 418 
Ala Gly lie Asp Asp Ala Ala Pro Ala Arg Gin Pro Arg Ala Leu Arg 

70 75 80 



ggt ggc gee gee acc aag gtt gcg gag egg agg gat ecc gtc aag aeg 
Gly Gly Ala Ala Thr Lys Val Ala Glu Arg Arg Asp Pro Val Lys Thr 

85 90 95 



466 



etc gat cgc gae gee geg gaa ggt ggc geg ccg tee ecg eeg gea ecg 514 
Leu Asp Arg Asp Ala Ala Glu Gly Gly Ala Pro Ser Pro Pro Ala Pro 



wo 00/66745 PCT/AUOO/00385 



-2- 



100 105 110 

agg cag gag gac gcc cgt ctg ccg age atg aac ggc atg ccg gtg aac 562 
Arg Gin Giu Asp Ala Arg Leu Pro Ser Met Asn Gly Met Pro Val Asn 
115 120 125 

* 

ggt gaa aac aaa tct acc ggc ggc ggc ggc gcg act aaa gac age ggg 610 
Gly Giu Asn Lys Ser Thr Gly Gly Gly Gly Ala Thr Lys Asp Ser Gly 
130 135 140 145 

ctg ccc gca ccc gca cgc gcg ccc cag ccg teg age cag aac aga gta 658 
Leu Pro Ala Pro Ala Arg Ala Pro Gin Pro Ser Ser Gin Asn Arg Val 

150 155 160 

ccg gtg aat ggt gaa aac aaa get aac gtc gcc teg ccg ccg acg age 706 
Pro Val Asn Gly Giu Asn Lys Ala Asn Val Ala Ser Pro Pro Thr Ser 

165 170 175 

ata gee gag gtc gcg get ccg gat ccc gca get ace att tec ate agt 754 
lie Ala Giu Val Ala Ala Pro Asp Pro Ala Ala Thr lie Ser lie Ser 
180 185 190 

gac aag gcg cea gag tec gtt gtc eca gee gag aag gcg ccg ccg teg 802 
Asp Lys Ala Pro Giu Ser Val Val Pro Ala Giu Lys Ala Pro Pro Ser 
195 200 205 

tec ggc tea aat ttc gtg ccc teg get tct get ccc ggg tct gac act 850 
Ser Gly Ser Asn Phe Val Pro Ser Ala Ser Ala Pro Gly Ser Asp Thr 
210 215 220 225 

gtc age gac gtg gaa ctt gaa ctg aag aag ggt gcg gtc att gtc aaa 898 
Val Ser Asp Val Giu Leu Giu Leu Lys Lys Gly Ala Val lie Val Lys 

230 235 240 

gaa get eca aac cea aag get ett teg ccg ccc gea gca ccc get gta 94 6 
Giu Ala Pro Asn Pro Lys Ala Leu Ser Pro Pro Ala Ala Pro Ala Val 

245 250 255 

eaa caa gae ett tgg gae ttc aag aaa tac att ggt ttc gag gag ccc 994 
Gin Gin Asp Leu Trp Asp Phe Lys Lys Tyr lie Gly Phe Giu Giu Pro 
260 265 270 

gtg gag gee aag gat gat ggc egg get gtt gca gat gat gcg ggc tec 1042 
Val Giu Ala Lys Asp Asp Gly Arg Ala Val Ala Aso Asp Ala Gly Ser 
275 280 285 

ttc gaa cac cac cag aat cac gat tec ggg cet ttg gea ggg gag aac 1090 
Phe Giu His His Gin Asn His Asp Ser Gly Pro Leu Ala Gly Giu Asn 
290 295 300 305 

gtc atg aac gtg gtc gtc gtg get get gaa tgt tct ccc tgg tgc aaa 1138 
Val Met Asn Val Val Val Val Ala Ala Giu Cys Ser Pro Trp Cys Lys 

310 315 320 

aca ggt ggt ett gga gat gtt gee ggt get ttg ccc aag get ttg gcg 1186 
Thr Gly Gly Leu Gly Asp Val Ala Gly Ala Leu Pro Lys Ala Leu Ala 

325 330 335 

aag aga gga eat cgt gtt atg gtt gtg gta cea agg tat ggg gac tat 1234 
Lys Arg Gly His Arg Val Met Val Val Val Pro Arg Tyr Gly Asp Tyr 
340 345 350 



gag gaa gcc tac gat gtc gga gtc cga aaa tac tac aag get get gga 
Giu Giu Ala Tyr Asp Val Gly Val Arg Lys Tyr Tyr Lys Ala Ala Gly 
355 360 365 



1282 
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cag gat atg gaa gtg aat tat ttc cat get tat ate gat gga gtt gat 1330 
Gin Asp Met Glu Val Asn Tyr Phe His Ala Tyr lie Asp Gly Val Asp 
370 375 380 385 

ttt gtg ttc att gac get cct etc ttc cga cac cgc cag gaa gac att 1378 
Phe Val Phe lie Asp Ala Pro Leu Phe Arg His Arg Gin Glu Asp lie 

390 395 400 

tat ggg ggc age aga cag gaa att atg aag cgc atg att ttg ttc tgc 1426 
Tyr Gly Gly Ser Arg Gin Glu lie Met Lys Arg Met lie Leu Phe Cys 

405 410 415 

aag gee get gtc gag gtt cea tgg cac gtt eca tgc ggc ggt gtc cct 1474 
Lys Ala Ala Val Glu Val Pro Trp His Val Pro Cys Gly Gly Val Pro 
420 425 430 

tat ggg gat gga aat ctg gtg ttt att gca aat gat tgg cac acg gca 1522 
Tyr Gly Asp Gly Asn Leu Val Phe lie Ala Asn Asp Trp His Thr Ala 
435 440 445 

etc etg cct gtc tat ctg aaa gca tat tac agg gac cat ggt ttg atg 1570 
Leu Leu Pro Val Tyr Leu Lys Ala Tyr Tyr Arg Asp His Gly Leu Met 
450 455 460 465 

cag tac act egg tec att atg gtg ata cat aac ate get cac cag ggc 1618 
Gin Tyr Thr Arg Ser lie Met Val lie His Asn lie Ala His Gin Gly 

470 475 480 

cgt ggc cca gta gat gag ttc ccg ttc acc gag ttg cct gag cac tac 1666 
Arg Gly Pro Val Asp Glu Phe Pro Phe Thr Glu Leu Pro Glu His Tyr 

485 490 495 

ctg gaa cac ttc aga ctg tac gac ccc gtg ggt ggt gaa cac gee aac 1714 
Leu Glu His Phe Arg Leu Tyr Asp Pro Val Gly Gly Glu His Ala Asn 
500 505 510 

tac ttc gee gee ggc ctg aag atg gcg gac cag gtt gtc gtc gtg age 17 62 

Tyr Phe Ala Ala Gly Leu Lys Met Ala Asp Gin Val Val Val Val Ser 

■ 515 520 525 

ccg ggg tac ctg tgg gag ctg aag acg gtg gag ggc ggc tgg ggg ctt 1810 

Pro Gly Tyr Leu Trp Glu Leu Lys Thr Val Glu Gly Gly Trp Gly Leu 

5.30 535 540 545 

cac gac ate ata egg cag aac gac tgg aag acc cgc ggc ate gtg aac 1858 
His Asp lie lie Arg Gin Asn Asp Trp Lys Thr Arg Gly lie Val Asn 

550 555 560 

ggc ate gac aac atg gag tgg aac ccc gag gtg gac gtc cac etc aag 1906 
Gly lie Asp Asn Met Glu Trp Asn Pro Glu Val Asp Val His Leu Lys 

565 570 575 

teg gac ggc tac acc aac ttc tec ctg ggg acg ctg gac tec ggc aag 1954 
Ser Asp Gly .Tyr Thr Asn Phe Ser Leu Gly Thr Leu Asp Ser Gly Lys 
580 . ■ S85 590 

• ■ 

egg cag tgc aag. gag gee etg cag egg gag etg ggc ctg cag gtc cgc 2002 

Arg Gin Cys Lys Glu Ala Leu Gin Arg Glu Leu Gly Leu Gin Val Arg 
5 95 *" 600 605 

ggc gac gtg ccg ctg etc ggc ttc ate ggg cgc ctg gac ggg cag aag 2050 
Gly Asp Val Pro Leu Leu Gly Phe lie Gly Arg Leu Asp Gly Gin Lys 
610 615 620 625 
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ggc gtg gag ate ate geg gac gcg atg ece tgg ate gtg age cag gac 
Gly Val Glu He He Ala Asp Ala Met Pro Trp He Val Ser Gin Asp 

635 640 



2098 



gtg cag ctg gtc atg ctg ggc aec ggg cgc cac gac ctg gag ggc atg 2146 
Val Gin Leu Val Met Leu Gly Thr Gly Arg His Asp Leu Glu Gly Met 

650 655 

etg egg cac ttc gag egg gag eae eac gac aag gtg cgc ggg tgg gtg 2194 
Leu Arg His Phe Glu Arg Glu His His Asp Lys Val Arg Gly Trp Val 
660 665 670 

ggg ttc tec gtg egg ctg gcg cac egg ate aeg gcc ggc gcc gac gcg 2242 
Gly Phe Ser Val . Arg Leu Ala His Arg He Thr Ala Gly Ala Asp Ala 
675 680 685 

cte etc atg ece tec egg ttc gag ecg tgc gga ctg aac cag etc tac 2290 
Leu Leu Met Pro Ser Arg Phe Glu Pro Cys Gly Leu Asn Gin Leu Tyr 

695 700 705 

gcc atg gcc tac ggc acc gtc ece gtc gtg cat gcc gtc ggt ggc etg 2338 
Ala Met Ala Tyr Gly Thr Val Pro Val Val His Ala Val Gly Gly Leu 

710 715 720 

agg gac acc gtg ccg ccg ttc gac ccc ttc aac eae tee ggg etc ggg 2386 
Arg Asp Thr Val Pro Pro Phe Asp Pro Phe Asn His Ser Gly Leu Glv 

725 730 735 

tgg acg ttc gac cgc gca gag geg cag aag ctg ate gag geg etc ggg 2434 
Trp Thr Phe Asp Arg Ala Glu Ala Gin Lys Leu He Glu Ala Leu Gly 
740 745 750 



eac tgc cte cgc acc tac egg gac tac aag gag age tgg agg ggg etc 
His Cys Leu Arg Thr Tyr Arg Asp Tyr Lys Glu Ser Trp Arg Gly Leu 
7S5 760 765 



2482 



cag gag cgc ggc atg teg cag gac ttc age tgg gag cat gcc gcc aag 2530 
Gin Glu Arg Gly Met Ser Gin Asp Phe Ser Trp Glu His Ala Ala Lys 
'^''O 775 780 / 785 



etc tac gag gac gtc etc gtc aag gcc aag tac cag tgg tgaaegetag 
Leu Tyr Glu Asp Val Leu Val Lys Ala Lys Tyr Gin Trp 

790 795 



2579 



ctgctagccg gtccagcccc geatgcgtgc atgacaggat ggaattgege attgcgeacg 2 639 
caggaaggtg ceatggageg eeggeatecg cgaagtacag tgacatgagg tgtgtgtggt 269 9 
tgagacgctg attccgatet ggteegtagc agagtagage ggaggtaggg aagegctcet 2759 
tgttacaggt atatgggaat gttgttaact tggtattgta atttgttatg ttgtgtgcat 2819 
tattacagag ggcaacgatc tgegceggcg caecggceca actgttgggc cggtcgcaca 287 9 
gcagccgttg gatccgaccg cctgggcegt tggatcccac cgaaaaaaaa aaaaaaaaaa 2939 

<210> 2 
<211> 798 
<212> PRT 

<213> Triticuin aestivum 
<400> 2 

Met Ser Ser Ala Val Ala Ser Ala Ala Ser Phe Leu Ala Leu Ala Ser 
1 5 . 10 15 
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Ala Ser Pro Gly 

20 



Pro His Thr Gly 
33 



Arg Thr Ala Arg 
50 



Asp Ala Gly He 

65 



Arg Gly Gly Ala 



Thr Leu Asp Arg 

100 



Pro Arg Gin Glu 
115 



Asn Gly Glu Asn 
130 

Gly Leu Pro Ala 
145 

Val Pro Val Asn 



Ser He Ala Glu 

180 



Ser Asp Lys Ala 
195 



Ser Ser Gly Ser 
210 



Thr Val Ser Asp 
225 



Lys Glu Ala Pro 



Val Gin Gin Asp 

2 60 



Pro Val Glu Ala 
275 



Ser Phe Glu Kis 
290 



Asn Val Met Asn 
305 



Lys Thr Gly Gly 



Ala Lys Arg Gly 

340 



Arg Ser Arg Arg 



Ala Gly Arg Leu 

40 

Asp Gly Ala Val 
55 

Asp Asp Ala Ala 
70 

Ala Thr Lys Val 
85 

Asp Ala Ala Glu 



Asp Ala Arg Leu 

120 

Lys Ser Thr Gly 
135 

Pro Ala Arg Ala 
150 

Gly Glu Asn Lys 
165 

Val Ala Ala Pro 



Pro Glu. Ser Val 

200 

Asn Phe Val Pro 
215 

Val Glu Leu Glu 
230 

Asn Pro Lys Ala 
245 

Leu Trp Asp Phe 



Lys Asp Asp Gly 

280 

His Gin Asn His 
295 

Val Val Val Val 
310 

Leu Gly Asp Val 
325 

His Arg Val Met 
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Arg Thr Arg Val 
25 

His Trp Pro Pro 



Ala Ala Arg Ala 

60 

Pro Ala Arg Gin 
75 

Ala Glu Arg Arg 
90 

Gly Gly Ala Pro 
105 

Pro Ser Met Asn 



Gly Gly Gly Ala 

[ 140 

Pro Gin Pro Ser 
155 

Ala Asn Val Ala 
170 

Asp Pro Ala Ala 
185 

Val Pro Ala Glu 



Ser Ala Ser Ala 

220 

Leu Lys Lys Gly 
235 

Leu Ser Pro Pro 
250 

Lys Lys Tyr He 
2 65 

Arg Ala Val Ala 



Asp Ser Gly Pro 

. 300 

Ala Ala Glu Cys 
315 

Ala Gly Ala Leu 
330 

Val Val Val Pro 
345 



Ser Ala Ser Pro 
30 

Ser Pro Pro Gin 
45 

Ala Gly Lys Lys 



Pro Arg Ala Leu 

80 

Asp Pro Val Lys 
95 

Ser Pro Pro Ala 
110 

Gly Met Pro Val 
125 

Thr Lys Asp Ser 



Ser Gin Asn Arg 

160 

Ser Pro Pro Thr 
175 

Thr lie Ser lie 
190 

Lys Ala Pro Pro 
205 

Pro Gly Ser Asp 



Ala Val He Val 

240 

Ala Ala Pro Ala 
255 

Gly Phe Glu Glu 
270 

Asp Asp Ala Gly 
285 

Leu Ala Gly Glu 



Ser Pro Trp Cys 

320 

Pro Lys Ala Leu 
335 

Arg Tyr Gly Asp 
350 
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Tyr Glu Glu Ala Tyr Asp Val Gly Val Arg Lys Tyr Tyr Lys Ala Ala 
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355 

Gly Gin Asp Met 
370 

Asp Phe Val Phe 
385 

lie Tyr Gly Gly 



Cys Lys Ala Ala 

420 

Pro Tyr Gly Asp 
435 

Ala Leu Leu Pro 
450 

Met Gin Tyr Thr 
4 65 

Gly Arg Gly Pro 



Tyr Leu Glu His 

500 

Asn Tyr Phe Ala 
515 

Ser Pro Gly Tyr 
530 

Leu His Asp lie 
545 

Asn Gly lie Asp 



Lys Ser Asp Gly 

580 

Lys Arg Gin Cys 
595 

Arg Gly Asp Va^ 
610 

Lys Gly Val Glu 
625 

Asp Val Gin Leu 



Met Leu Arg His 

660 

Val Gly Phe Ser 
675 

Ala Leu Leu Met 
690 



360 

Glu Val Asn Tyr 
375 

lie Asp Ala Pro 
390 

Ser Arg Gin Glu 
405 

Val Glu Val Pro 



Gly Asn Leu Val 

'440 

Val Tyr Leu Lys 
455 

Arg Ser lie Met 
470 

Val Asp Glu Phe 
485 

Phe Arg Leu Tyr 



Ala Gly Leu Lys 

520 

Leu Trp Glu Leu 
535 . 

lie Arg Gin Asn 

550 

Asn Met Glu Trp 
565 

Tyr Thr Asn Phe 



Lys Glu Ala Leu 

600 

Pro Leu Leu Gly 
615 

lie lie Ala Asp 
630 

Val Met Leu Gly 
645 

Phe Glu Arg Glu 



Val Arg Leu Ala 

680 

Pro Ser Arg Phe 
695 
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Phe His Ala Tyr 

380 

Leu Phe Arg His 
395 

lie Met Lys Arg 
410 

Trp His Val Pro 
425 

Phe lie Ala Asn 



Ala Tyr Tyr Arg 

460 

Val lie His Asn 
475 

Pro Phe Thr Glu 
4 90 

Asp Pro Val Gly 
505 • 

Met Ala Asp Gin 



Lys Thr Val Glu 

540 

Asp Trp Lys Thr 
555 

Asn Pro Glu Val 
570 

Ser Leu Gly Thr 
585 

Gin Arg Glu Leu 



Phe lie Gly Arg 

620 

Ala Met Pro Trp 
635 

Thr Gly Arg His 
650 

His His Asp Lys 
665 

His Arg lie Thr 



Glu Pro Cys Gly 

700 



365 

lie Asp Gly Val 



Arg Gin Glu Asp 

400 

Met lie Leu Phe 
415 

Cys Gly Gly Val 
430 

Asp Trp His Thr 
445 

Asp His Gly Leu 



lie Ala His Gin 

480 

Leu Pro Glu His 
4 95 

Gly Glu His Ala 
510 

Val Val Val Val 
525 

Gly Gly Trp Gly 



Arg Gly lie Val 

560 

Asp Val His Leu 
575 

Leu Asp Ser Gly 
590 

Gly Leu Gin Val 
605 

Leu Asp Gly Gin 



lie Val Ser Gin 

640 

Asp Leu Glu Gly 
655 

Val Arg Gly Trp 
670 

Ala Gly Ala Asp 
685 

Leu Asn Gin Leu 
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Tyr Ala Met Ala Tyr Gly Thr Val Pro Val Val His Ala Val Gly Gly 
705 710 715 720 

Leu Arg Asp Thr Val Pro Pro Phe Asp Pro Phe Asn His Ser Gly Leu 

725 730 735 

Gly Trp Thr Phe Asp Arg Ala Glu Ala Gin Lys Leu lie Glu Ala Leu 

740 745 750 

Gly His Cys Leu Arg Thr Tyr Arg Asp Tyr Lys Glu Ser Trp Arg Gly 
755 760 765 

Leu Gin Glu Arg Gly Met Ser Gin Asp Phe Ser Trp Glu His Ala Ala 
770 775 780 

Lys Leu Tyr Glu Asp Val Leu Val Lys Ala Lys Tyr Gin Trp 
785 790 795 

<210> 3 
<211> 2842 
<212> DNA 

<213> Triticum aestivum 

<220> 
<221> CDS 

<222> (89) . . (2485) 
<400> 3 

gctgccacca cctccgcctg cgccgcgctc tgggcggagg accaacccgc gcatcgtacc 60 

atcgcccgcc ccgatcccgg ccgccgcc atg teg teg geg gtc gcg tec gee 112 

Met Ser Ser Ala Val Ala Ser Ala 
1 5 

gcg tec ttc etc geg etc gee tee gee tec ecc ggg aga tea ege agg 160 
Ala Ser Phe Leu Ala Leu Ala Ser Ala Ser Pro Gly Arg Ser Arg Arg 
10 15 20 

egg geg agg gtg age gcg ccg cea ccc cae gee ggg gcc ggc agg ctg 208 
Arg Ala Arg Val Ser Ala Pro Pro Pro His Ala Gly Ala Gly Arg Leu 
25 30 35 40 

cac tgg cog ccg tgg ccg ccg cag cgc acg get cgc gac gga ggt gtg 256 
His Trp Pro Pro Trp Pro Pro Gin Arg Thr Ala Arg Asp Gly Gly Val 

45 50 55 

gcc gcg cgc gcc gcc ggg aag aag gac gcg agg gtc gac gac gae gee 304 
Ala Ala Arg Ala Ala Gly Lys Lys Asp Ala Arg Va\ Asp Asp Asp Ala 

60 65 70 

gcg tee geg agg cag ccc cgc gea cgc cgc ggt ggc gee gcc ace aag 352 
Ala Ser Ala Arg Gin Pro Arg Ala Arg Arg Gly Gly Ala Ala Thr Lys 
75 80 85 

gtc gcg gag egg agg gat ccc gtc aag acg etc gat cgc gac gcc gcg 400 
Val Ala Glu Arg Arg Asp Pro Val Lys Thr Leu Asp Arg Asp Ala Ala 
90 95 100 

gaa ggt ggc gcg ccg gca ccg ccg gca ccg agg cag gac gcc gcc cgt 448 
Glu Gly Gly Ala Pro Ala Pro Pro Ala Pro Arg Gin Asp Ala Ala Arg 

110 115 120 

cca ccg agt atg aac ggc acg ccg gtg aac ggt gag aac aaa tet ace 4 96 
Pro Pro Ser Met Asn Gly Thr Pro Val Asn Gly Glu Asn Lys Ser Thr 
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125 130 135 

ggc ggc ggc ggc gcg acc aaa gac age ggg ctg ccc gca ccc gca cgc 544 
Gly Gly Giy Gly Ala Thr Lys Asp Ser Gly Leu Pro Ala Pro Ala Arg 

140 145 150 

gcg ccc cat ccg teg acc cag aac aga gta oca gtg aac ggt gaa aac 592 
Ala Pro His Pro Ser Thr Gin Asn Arg Val Pro Val Asn Gly Glu Asn 
155 150 165 

aaa get aac gtc gcc teg ecg ccg acg age ata gee gag gte gtg get 640 
Lys Ala Asn Val Ala Ser Pro Pro Thr Ser lie Ala Glu Val Val Ala 
170 175 180 

ccg gat tec gca get acc att tec ate agt gac aag gcg ccg gag tec 688 
Pro Asp Ser Ala Ala Thr He Ser He Ser Asp Lys Ala Pro Glu Ser 
185 190 195 200 

gtt gtc cca gcc gag aag ccg ccg ccg teg tec ggc tea aat ttc gtg 736 
Val Val Pro Ala Glu Lys Pro Pro Pro Ser Ser Gly Ser Asn Phe Val 

205 210 -215 

gtc teg get tct get ccc agg ctg gac att gac age gat gtt gaa cet ' 784 
Val Ser Ala Ser Ala Pro Arg Leu Asp He Asp Ser Asp Val Glu Pro 

220 225 230 

gaa ctg aag aag ggt gcg gte ate gte gaa gaa get cca aac cca aag 832 
Glu Leu Lys Lys Gly Ala Val He Val Glu Glu Ala Pro Asn Pro Lys 
235 240 245 

get ett teg ccg cet gca gee ccc get gta caa gaa gac ett tgg gac 880 
Ala Leu Ser Pro Pro Ala Ala Pro Ala Val Gin Glu Asp Leu Trp Asp 
250 255 260 

ttc aag aaa tac att ggc ttc gag gag ccc gtg gag gcc aag gat gat 928 
Phe Lys Lys Tyr He Gly Phe Glu Glu Pro Val Glu Ala Lys Asp Asp 
265 270 275 280 

ggc tgg get gtt gca gat gat gcg ggc tec ttt gaa cat cac cag aac 976 
Gly Trp Ala Val Ala Asp Asp Ala Gly Ser Phe Glu His His Gin Asn 

285 290 295 ' 

cat gat tec gga cet ttg gca ggg gag aac gtc atg aac gtg gtc gtc 1024 
His Asp Ser Gly Pro Leu Ala Gly Glu Asn Val Met Asn Val Val Val 

300 305 310 

gtg get get gaa tgt tct cee tgg tgc aaa aca ggt ggt ett gga gat 1072 
Val Ala Ala Glu Cys Ser Pro Trp Cys Lys Thr Gly Gly Leu Gly Asp 
315 320 325 

gtt gcc ggt get ttg ccc aag get ttg gcg aag aga gga cat cgt gtt 1120 
Val Ala Gly Ala Leu Pro Lys Ala Leu Ala Lys Arg Gly His Arg Val 
330 335 340 

atg gtt gtg gta cca agg tat ggg gac tat gag gaa gcc tac gat gtc 1168 
Met Val Val Val Pro Arg Tyr Gly Asp Tyr Glu Glu Ala Tyr Asp Val 
345 350 355 360 

gga gtc cga aaa tac tac aag get get gga cag gat atg gaa gtg aat 1216 
Gly Val Arg Lys Tyr Tyr Lys Ala Ala Gly Gin Asp Met Glu Val Asn 

365 370 375 

tat ttc cat get tat ate gat gga gtt gat ttt gtg ttc att gac get 1264 
Tyr Phe His Ala Tyr He Asp Gly Val Asp Phe Val Phe He Asp Ala 

380 385 390 
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cct etc ttc cga cac cgc cag gaa gac att tat ggg ggc age aga cag 1312 
Pro Leu Phe Arg His Arg Gin Glu Asp lie Tyr Gly Gly Ser Arg Gin 
395 400 405 

gaa att atg aag cgc atg att ttg ttc tgc aag gcc get gtc gag gtt 1360 
Glu lie Met Lys Arg Met lie Leu Phe Cys Lys Ala Ala Val Glu Val 
410 415 420 

cct tgg cac gtt cca tgc ggc ggt gtc cct tat ggg gat gga aat ctg 1408 
Pro Trp His Val Pro Cys Gly Gly Val Pro Tyr Gly Asp Gly Asn Leu 
425 430 435 440 

gtg ttt att gca aat gat tgg cac acg gca etc ctg cct gtc tat ctg 14 56 
Val Phe lie Ala Asn Asp Trp His Thr Ala Leu Leu Pro Val Tyr Leu 

445 450 455 

aaa gca tat tac agg gac cat ggt ttg atg cag tac act egg tee att 1504 
Lys Ala Tyr Tyr Arg Asp His Gly Leu Met Gin Tyr Thr Arg Ser lie 

460 465 470 

atg gtg ata eat aae ate geg cac cag ggc egt ggc cca gta gat gaa 1552 
Met Val lie His Asn lie Ala His Gin Gly Arg Gly Pro Val Asp Glu 
475 480 485 

ttc eeg ttc ace gag ttg cct gag cac tac ctg gaa cac ttc aga ctg 1600 
Phe Pro Phe Thr Glu Leu Pro Glu His Tyr Leu Glu His Phe Arg Leu 
490 495 500 

tac gac ccc gtg ggt ggt gag cac gcc aae tac ttc gcc gcc ggc ctg 164 8 
Tyr Asp Pro Val Gly Gly Glu His Ala Asn Tyr Phe Ala Ala Gly Leu 
505 510 515 520 

aag atg geg gac cag gtt gtc gtg gtg age ccc ggg tac ctg tgg gag 1696 
Lys Met Ala Asp Gin Val Val Val Val Ser Pro Gly Tyr Leu Trp Glu 

525 530 ■ 535 

etc aag acg gtg gag ggc ggc tgg ggg ett eae gac ate ata egg eag 1744 
Leu Lys Thr Val Glu Gly Gly Trp Gly Leu His Asp lie lie Arg Gin 

540 545 550 

aae gac tgg aag ace cgc ggc ate gtc aae ggc ate gac aae atg gag 1792 
Asn Asp Trp Lys Thr Arg Gly He Val Asn Gly He Asp Asn Met Glu 
555 560 565 

tgg aac ccc gag gtg gac gtc cac etc aag teg gac ggc tac ace aae 1840 
Trp Asn Pro Glu Val Asp Val His Leu Lys Ser Asp Gly Tyr Thr Asn 
570 575 580 

ttc tec ctg ggg acg ctg gac tec ggc aag egg eag tgc aag gag gee 1888 
Phe Ser Leu Gly Thr Leu Asp Ser Gly Lys Arg Gin Cys Lys Glu Ala 
585 590 595 600 

ctg cag cgc gag ctg ggc ctg eag gtc cgc gcc gac gtg eeg ctg etc 1936 
Leu Gin Arg Glu Leu Gly Leu Gin Val Arg Ala Asp Val Pro Leu Leu 

605 610 615 

ggc ttc ate ggc cgc ctg gac ggg cag aag ggc gtg gag ate ate geg 1984 
Gly Phe He Gly Arg Leu Asp Gly Gin Lys Gly Val Glu He He Ala 

620 625 630 



gac gcc atg ccc tgg ate gtg age cag gac gtg cag ctg gtc atg ctg 
Asp Ala Met Pro Trp He Val Ser Gin Asp Val Gin Leu Val Met Leu 
635 640 645 



2032 
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ggc acc ggc cgc cac gac ctg gag age atg ctg egg cac ttc gag egg 2080 
Gly Thr Gly Arg His Asp Leu Glu Ser Met Leu Arg His Phe Glu Arg 
650 655 660 

gag cac cac gac aag gtg cgc ggg tgg gtg ggg ttc tec gtg cgc ctg 2128 
Glu His His Asp Lys Val Arg Gly Trp Val Gly Phe Ser Val Arg Leu 
665 670 675 680 

gcg cac egg ate acg gcg ggc gee gac geg etc etc atg cec tec egg 2176 
Ala His Arg lie Thr Ala Gly Ala Asp Ala Leu Leu Met Pro Ser Arg 

685 690 695 

ttc gag ccg tge ggg ttg aac eag ctt tac gee atg gee tac ggc acc 2224 
Phe Glu Pro Cys Gly Leu Asn Gin Leu Tyr Ala Met Ala Tyr Gly Thr 

700 705 710 

gtc ccc gtc gtg cac gcc gtc ggc ggg gtg agg gac acc gtg ccg ccg 2272 
Val Pro Val Val His Ala Val Gly Gly Val Arg Asp Thr Val Pro Pro 
715 720 725 

ttc gac ccc ttc aac cac tec ggc etc ggg tgg acg ttc gac cgc gcc 2320 
Phe Asp Pro Phe Asn His Ser Gly Leu Gly Trp Thr Phe Asp Arg Ala 
730 735 740 

gag gcg cac aag ctg ate gag geg etc ggg cac tge etc cgc ace tac 2368 
Glu Ala His Lys Leu lie Glu Ala Leu Gly His Cys Leu Arg Thr Tyr 
745 750 755 760 

egg gae tac aag gag age tgg agg ggc etc eag gag cgc ggc atg teg 2416 
Arg Asp Tyr Lys Glu Ser Trp Arg Gly Leu Gin Glu Arg Gly Met Ser 

765 770 775 

eag gae ttc age tgg gag eat gee gcc aag etc tac gag gac gtc etc 2464 
Gin Asp Phe Ser Trp Glu His Ala Ala Lys Leu Tyr Glu Asp Val Leu 

780 785 790 

etc aag gcc aag tac cag tgg tgaacgctag ctgctagccg ctccagcccc 2515 
Leu Lys Ala Lys Tyr Gin Trp 
795 

gcatgcgtgc atgcatgaga gggtggaact gcgcattgcg cccgcaggaa cgtgccatcc 2575 

ttctcgatgg gagcgccggc atcegegagg tgcagtgaca tgagaggtgt gtgtggttga 2635 

gacgctgatt ccgatctcga tctggtccgt agcagagtag ageggaegta gggaageget 2695 

ccttgttgca ggtatatggg aatgttgtea acttggtatt gtagtttget atgttgtatg 2755 

cgttattaca atgttgttac ttattcttgt taagtcggag gcaaagggcg aaagetagct 2815 

caeatgaaaa aaaaaaaaaa aaaaaaa 2842 



<210> 4 
<211> 799 
<212> PRT 

<213> Triticum aestivum 
<400> 4 

Met Ser Ser Ala Val Ala Ser Ala Ala Ser Phe Leu Ala Leu Ala Ser 
15 10 15 



Ala Ser Pro Gly Arg Ser Arg Arg Arg Ala Arg Val Ser Ala Pro Pro 

20 25 30 
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Pro His Ala Gly Ala Giy Arg Leu His Trp Pro Pro Trp Pro Pro Gin 
35 40 45 

Arg Thr Ala Arg Asp Gly Gly Val Ala Ala Arg Ala Ala Gly Lys Lvs 
50 55 60 

■ Asp Ala Arg Val Asp Asp Asp Ala Ala Ser Ala Arg Gin Pro Arg Ala 

70 75 80 

Arg Arg Gly Gly Ala Ala Thr Lys Val Ala Glu Arg Arg Asp Pro Val 

85 90 95 

Lys Thr Leu Asp Arg Asp Ala Ala Glu Gly Gly Ala Pro Ala Pro Pro 

100 105 no 

Ala Pro Arg Gin Asp Ala Ala Arg Pro Pro Ser Met Asn Gly Thr Pro 
115 120 125 

Val Asn Gly Glu Asn Lys Ser Thr Gly Gly Gly Gly Ala Thr Lys Asp 
130 135 140 

Ser Gly Leu Pro Ala Pro Ala Arg Ala Pro His Pro Ser Thr Gin Asn 

150 155 160 

Arg Val Pro Val Asn Gly Glu Asn Lys Ala Asn Val Ala Ser Pro Pro 

165 170 175 

Thr Ser lie Ala Glu Val Val Ala Pro Asp Ser Ala Ala Thr lie Ser 

180 185 . 190 

lie Ser Asp Lys Ala Pro Glu Ser Val Val Pro Ala Glu Lys Pro Pro 
195 200 205 

Pro Ser Ser Gly Ser Asn.Phe Val Val Ser Ala Ser Ala Pro Arg Leu 
210 215 220 

Asp lie Asp Ser Asp Val Glu Pro Glu Leu Lys Lys Gly Ala Val lie 
225 230 235 240 

Val Glu Glu Ala Pro Asn Pro Lys Ala Leu Ser Pro Pro Ala Ala Pro 

245 250 255 

Ala Val Gin Glu Asp Leu Trp Asp Phe Lys Lys Tyr He Gly Phe Glu 

260 265 270 

Glu Pro Val Glu Ala Lys Asp Asp Gly Trp Ala Val Ala Asp Asp Ala 
275 280 285 

Gly Ser Phe Glu His His Gin Asn His Asp Ser Gly Pro Leu Ala Gly 
290 295 300 

Glu Asn Val Met Asn Val Val Val Val Ala Ala Glu Cys Ser Pro Trp 
305 310 315 320 

Cys Lys Thr Gly Gly Leu Gly Asp Val Ala Gly Ala Leu Pro Lys Ala 

325 330 335 

Leu Ala Lys Arg Gly His Arg Val Met Val Val Val Pro Arg Tyr Gly 

340 345 350 

Asp Tyr Glu Glu Ala Tyr Asp Val Gly Val Arg Lys Tyr Tyr Lys Ala 
355 360 365 

Ala Gly Gin Asp Met Glu Val Asn Tyr Phe His Ala Tyr lie Asp Gly 
370 375 380 
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Val Asp Phe Val Phe lie Asp Ala Pro Leu Phe Arg His Arg Gin Glu 

390 395 400 

Asp He Tyr Gly Gly Ser Arg Gin Glu He Met Lys Arg Met He Leu 

405 410 415 

Phe Cys Lys Ala Ala Val Glu Val Pro Trp His Val Pro Cys Gly Gly 

420 425 430 

Val Pro Tyr Gly Asp Gly Asn Leu Val Phe He Ala Asn Asp Trp His 
435 440 445 

Thr Ala Leu Leu Pro Val Tyr Leu Lys Ala Tyr Tyr Arg Asp His Gly 
450 455 460 

Leu Met Gin Tyr Thr Arg Ser He Met Val He His Asn He Ala His 

470 475 480 

Gin Gly Arg Gly Pro Val Asp Glu Phe Pro Phe Thr Glu Leu Pro Glu 

■ 485 490 495 

His Tyr Leu Glu His Phe Arg Leu Tyr Asp Pro Val Gly Gly Glu His 

500 505 510 

Ala Asn Tyr Phe Ala Ala Gly Leu Lys Met Ala Asp Gin Val Val Val 
515 520 525 

Val Ser Pro Gly Tyr Leu Trp Glu Leu Lys Thr Val Glu Gly Gly Trp 
530 535 540 

Gly Leu His Asp He He Arg Gin Asn Asp Trp Lys Thr Arg Gly He 
545 550 555 560 

Val Asn Gly He Asp Asn Met Glu Trp Asn Pro Glu Val Asp Val His 

565 570 575 

Leu Lys Ser Asp Gly Tyr Thr Asn Phe Ser Leu Gly Thr Leu Asp Ser 

580 585 590 

Gly Lys Arg Gin Cys Lys Glu Ala Leu Gin Arg Glu Leu Gly Leu Gin 
595 600 605 

Val Arg Ala Asp Val Pro Leu Leu Gly Phe He Gly Arg Leu Asp Gly 
610 615 620 

Gin Lys Gly Val Glu He He Ala Asp Ala Met Pro Trp He Val Ser 
625 630 635 640 

Gin Asp Val Gin Leu Val Met Leu Gly Thr Gly Arg His Asp Leu Glu 

645 650 655 

Ser Met Leu Arg His Phe Glu Arg Glu His His Asp Lys Val Arg Gly 

660 665 670 

Trp Val Gly Phe Ser Val Arg Leu Ala His Arg He Thr Ala Gly Ala 
675 680 685 

Asp Ala Leu Leu Met Pro Ser Arg Phe Glu Pro Cys Gly Leu Asn Gin 
690 695 700 

Leu Tyr Ala Met Ala Tyr Gly Thr Val Pro Val Val His Ala Val Gly 
705 710 715 720 

Gly Val Arg Asp Thr Val Pro Pro Phe Asp Pro Phe Asn His Ser Gly 
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725 730 735 

Leu Gly Trp Thr Phe Asp Arg Ala Glu Ala His Lys Leu He Glu Ala 

740 745 750 

Leu Gly His Cys Leu Arg Thr Tyr Arg Asp Tyr Lys Glu Ser Trp Arg 
755 760 765 

Gly Leu Gin Glu Arg Gly Met Ser Gin Asp Phe Ser Trp Glu His Ala 
770 775 780 

Ala Lys Leu Tyr Glu Asp Val Leu Leu Lys Ala Lys Tyr Gin Trp 
785 790 795 



<210> 5 

<211> 2107 

<212> DNA 

<213> Triticum aestivum 

<220> 

<221> CDS 

<222> (1) . . (1791) 

<400> 5 

cca get gag aag acg ccg ccg teg tec gge tea aat ttc gag tec teg 4 8 

Pro Ala Glu Lys Thr Pro Pro Ser Ser Gly Ser Asa Phe Glu Ser Ser 

1 5 .10 .15 

gee tet get eec ggg tet gac aet gte age gac gtg gaa caa gaa ctg 96 
Ala Ser Ala Pro Gly Ser Asp Thr Val Ser Asp Val Glu Gin Glu Leu 

20 25 30 

aag aag ggt gcg gtc gtt gte gaa gaa get cca aag cca aag get ctt 14 4 
Lys Lys Gly Ala Val Val Val Glu Glu Ala Pro Lys Pro Lys Ala Leu 
35 40 45 

teg ccg cct gca gcc ccc get gta caa gaa gac ctt tgg gat ttc aag 192 
Ser Pro Pro Ala Ala Pro Ala Val Gin Glu Asp Leu Trp Asp Phe Lys 
50 55 60 

aaa tac att ggt ttc gag gag ccc gtg gag gcc aag gat gat gge egg 240 
Lys Tyr He Gly Phe Glu Glu Pro Val Glu Ala Lys Asp Asp Gly Arg 
65 70 75 80 

get gtc gca gat gat gcg gge tee ttt gaa eac cae eag aat eae gae 288 
Ala Val Ala Asp Asp Ala Gly Ser Phe Glu His His Gin Asn His Asp 

85 90 95 

tee gga eet ttg gca ggg gag aat gte atg aac gtg gte gte gtg get 336 
Ser Gly Pro Leu Ala Gly Glu Asn Val Met Asn Val Val Val Val Ala 

100 105 110 

get gag tgt tet ccc tgg tgc aaa aca ggt ggt ctg gga gat gtt gcg 384 
Ala Glu Cys Ser Pro Trp Cys Lys Thr Gly Gly Leu Gly Asp Val Ala 
115 120 125 

ggt get ctg ccc aag get ttg gca aag aga gga cat cgt gtt atg gtt 432 
Gly Ala Leu Pro Lys Ala Leu Ala Lys Arg Gly His Arg Val Met Val 
130 135 140 

gtg gta cca agg tat ggg gac tat gaa gaa ect acg gat gtc gga gte 480 
Val Val Pro Arg Tyr Gly Asp Tyr Glu Glu Pro Thr Asp Val Gly Val 
145 150 155 160 
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cga aaa tac tac aag get get gga cag gat atg gaa gtg aat tat ttc 328 
Arg Lys Tyr Tyr Lys Ala Ala Gly Gin Asp Met Glu Val Asn Tyr Phe 

165 170 175 

cat get tat ate gat gga gtt gat ttt gtg ttc att gac get cct etc 576 
His Ala Tyr lie Asp Gly Val Asp Phe Val Phe lie Asp Ala Pro Leu 

180 185 190 

ttc cga cac cga gag gaa gac att tat ggg ggc age aga cag gaa att 624 
Phe Arg His Arg Glu Glu Asp lie Tyr Gly Gly Ser Arg Gin Glu lie 
195 200 205 

atg aag cge atg att ttg ttc tgc aag gcc get gtt gag gtt cea tgg 672 
Met Lys Arg Met lie Leu Phe Cys Lys Ala Ala Val Glu Val Pro Trp 
210 215 220 

eac gtt cca tgc ggc ggt gte cct tat ggg gat gga aat etg gtg ttt 720 
His Val Pro Cys Gly Gly Val Pro Tyr Gly Asp Gly Asn Leu Val Phe 
225 230 235 240 

att gca aat gat tgg cae aeg gea etc etg cct gte tat etg aaa gca 768 
lie Ala Asn Asp Trp His Thr Ala Leu Leu Pro Val Tyr Leu Lys Ala 

245 250 255 



tat tac agg gac cat ggt ttg atg cag tac act egg tec att atg gtg 
Tyr Tyr Arg Asp His Gly Leu Met Gin Tyr Thr Arg Ser lie Met Val 

260 265 270 



816 



ata cat aae ate get eae cag ggc cgt ggc cct gta gat gaa ttc ceg 8 64 
lie His Asn lie Ala His Gin Gly Arg Gly Pro Val Asp Glu Phe Pro 
275 280 285 

ttc ace gag ttg cct gag cae tac etg gaa cae ttc aga etg tac gac 912 
Phe Thr Glu Leu Pro Glu His Tyr Leu Glu His Phe Arg Leu Tyr Asp 
290 295 300 

cec gtg ggt ggt gaa cae gee aae tac ttc gcc gee ggc etg aag atg 960 
Pro Val Gly Gly Glu His Ala Asn Tyr Phe Ala Ala Gly Leu Lys Met 
305 310 315 320 

gcg gac cag gtt gte gtg gtg age ccc ggg tac etg tgg gag etg aag 1008 
Ala Asp Gin Val Val Val Val Ser Pro Gly Tyr Leu Trp Glu Leu Lys 

325 330 335 

acg gtg gag ggc ggc tgg ggg ctt cac gac ate ata egg cag aae gac 1056 
Thr Val Glu Gly Gly Trp Gly Leu His Asp lie He Arg Gin Asn Asp 

340 345 350 

tgg aag ace egc ggc ate gte aae ggc ate gac aae atg gag tgg aae 1104 
Trp Lys Thr Arg Gly He Val Asn Gly He Asp Asn Met Glu Trp Asn 
355 360 365 

eee gag gtg gac gcc eac etc aag teg gac ggc tac ace aae ttc tee 1152 
Pro Glu Val Asp Ala His Leu Lys Ser Asp Gly Tyr Thr Asn Phe Ser 
370 375 380 

etg agg acg etg gac tec ggc aag egg cag tgc aag gag gcc etg cag 1200 
Leu Arg Thr Leu Asp Ser Gly Lys Arg Gin Cys Lys Glu Ala Leu Gin 
385 390 395 400 

cgc gag etg ggc etg cag gte cge gcc gac gtg ccg etg etc ggc ttc 1248 
Arg Glu Leu Gly Leu Gin Val Arg Ala Asp Val Pro Leu Leu Gly Phe 

405 410 415 

ate ggc egc etg gac ggg cag aag ggc gtg gag ate ate gcg gac gee 1296 
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He Gly Arg Leu Asp Gly Gin Lys Gly Val Glu He He Ala Asp Ala 

420 425 430 

atg ccc tgg ate gtg age cag gac gtg cag ctg gtg atg ctg ggc acc 1344 
Met Pro Trp He Val Ser Gin Asp Val Gin Leu Val Met Leu Gly Thr 
435 440 445 

ggg cgc cac gac ctg gag age atg ctg cag cae ttc gag egg gag cac 1392 
Gly Arg His Asp Leu Glu Ser Met Leu Gin His Phe Glu Arg Glu His 
430 455 460 

cac gac aag gtg cgc ggg tgg gtg ggg ttc tec gtg cgc ctg gcg cae 14 40 
His Asp Lys Val Arg Gly Trp Val Gly Phe Ser Val Arg Leu Ala His 

470 475 480 

egg ate acg gcg ggg gcg gac gcg etc etc atg ccc tec egg ttc gtg 1488 
Arg He Thr Ala Gly Ala Asp Ala Leu Leu Met Pro Ser Arg Phe Val 

485 490 495 

ecg tgc ggg ctg aac cag etc tac gee atg gee tac ggc ace gtc ccc' 1536 
Pro Cys Gly Leu Asn Gin Leu Tyr Ala Met Ala Tyr Gly Thr Val Pro 

500 505 510 

gtc gtg cac gee gtc ggc ggc etc agg gac acc gtg ccg ccg ttc gac 1584 
Val Val His Ala Val Gly Gly Leu Arg Asp Thr Val Pro Pro Phe Asp 
515 520 525 

ccc ttc aac cac tec ggg etc ggg tgg acg ttc gac cgc gee gag gcg 1632 
Pro Phe Asn His Ser Gly Leu Gly Trp Thr Phe Asp Arg Ala Glu Ala 
530 535 540 

cac aag ctg ate gag gcg etc ggg cac tgc etc cgc acc tac cga gac 1680 
His Lys Leu He Glu Ala Leu Gly His Cys Leu Arg- Thr Tyr Arg Asp 
545 550 555 560 



ttc aag gag age tgg agg gee etc cag gag cgc ggc atg teg cag gac 
Phe Lys Glu Ser Trp Arg Ala Leu Gin Glu Arg Gly Met Ser Gin Asp 

565 570 575 



1728 



ttc age tgg gag cac gee gee aag ete tac gag gac gtc etc gtc aag 1776 
Phe Ser Trp Glu His Ala Ala Lys Leu Tyr Glu Asp Val Leu Val Lys 

580 585 590 

gee aag tac cag tgg tgaacgetag ctgctagccg ctecagccce gcatgcgtgc 1831 
Ala Lys Tyr Gin Trp 
595 

atgacaggat ggaactgcat tgegcacgea ggaaagtgcc atggagcgce ggcatccgeg 1891 
aagtacagtg acatgaggtg tgtgtggttg agacgetgat tecaatccgg cccgtageag 1951 
agtagagegg aggtatatgg gaatcttaac ttggtattgt aatttgttat gttgtgtgca 2011 
ttattacaat gttgttactt attcttgtta agteggagge caagggegaa agetagctea 2071 
catgtctgat ggatgcaaaa aaaaaaaaaa aaaaaa 2107 

<210> 6 
<211> 597 
<212> PR? 

<213> Triticum aestivum 
<400> 6 

Pro Ala Glu Lys Thr Pro Pro Ser Ser Gly Ser Asn Phe Glu Ser Ser 
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1 



Ala Ser Ala Pro 

20 



Lys Lys Gly Ala 
35 



Ser Pro Pro Ala 
50 



Lys Tyr lie Gly 
65 



Ala Val Ala Asp 



Ser Gly Pro Leu 

100 



Ala Glu Cys Ser 
115 



Gly Ala Leu Pro 
130 



Val Val Pro Arg 
145 



Arg Lys Tyr Tyr 



His Ala Tyr lie 

180 

Phe Arg His Arg 
195 



Met Lys Arg Met 
210 



His Val Pro Cys 
225 



lie Ala Asn Asp 



Tyr Tyr Arg Asp 

2 60 

He His Asn He 
275 

Phe Thr Glu Leu 
290 



Pro Val Gly Gly 
305 



Ala Asp Gin Val 



Thr Val Glu Gly 

340 



5 



Gly Ser Asp Thr 



Val Val Val Glu 

40 



Ala Pro Ala Val 
55 



Phe Glu Glu Pro 
70 



Asp Ala Gly Ser 
85 



Ala Gly Glu Asn 



Pro Trp Cys Lys 

120 

Lys Ala Leu Ala 
135 

Tyr Gly Asp Tyr 
150 

Lys Ala Ala Gly 
165 



Asp Gly Val Asp 



Glu Glu Asp He 

200 

He Leu Phe Cys 
215 



Gly Gly Val Pro 
230 

Trp His Thr Ala 
245 



His Gly Leu Met 



Ala His Gin Gly 

280 

Pro Glu His Tyr 
295 

Glu His Ala Asn 
310 

Val Val Val Ser 
325 

Gly Trp Gly Leu 



- 16- 



10 

Val Ser Asp Val 
25 

Glu Ala Pro Lys 



Gin Glu Asp Leu 

60 

Val Glu Ala Lys 
75 

Phe Glu His His 
90 

Val Met Asn Val 
105 

Thr. Gly Gly Leu 



Lys Arg Gly His 

140 

Glu Glu Pro Thr 
155 

Gin Asp Met Glu 
170 

Phe Val Phe He 
185 

Tyr Gly Gly Ser 



Lys Ala Ala Val 

220 

Tyr Gly Asp Gly 
235 

Leu Leu Pro Val 
250 

Gin Tyr Thr Arg 
2 65 

Arg Gly Pro Val 



Leu Glu His Phe 

300 

Tyr Phe Ala Ala 
315 

Pro Gly Tyr Leu 
330 

His Asp lie He 
345 



15 

Glu Gin Glu Leu 
30 



Pro Lys Ala Leu 
45 



Trp Asp Phe Lys 



Asp Asp Gly Arg 

80 

Gin Asn His Asp 
95 

Val Val Val Ala 
110 

Gly Asp Val Ala 
125 . 

Arg Val Met Val 



Asp Val Gly Val 

160 

Val Asn Tyr Phe 
175 

Asp Ala Pro Leu 
190 

Arg Gin Glu He 
205 

Glu Val Pro Trp 



Asn Leu Val Phe 

240 

Tyr Leu Lys Ala 
255 

Ser He Met Val 
270 

Asp Glu Phe Pro 
285 



Arg Leu Tyr Asp 



Gly Leu Lys Met 

320 

Trp Glu Leu Lys 
335 



Arg Gin Asn Asp 
350 



wo 00/66745 



PCT/AUOO/00385 



- 17- 



Trp Lys Thr Arg Gly He Val Asn Giy He Asp Asn Met Glu Trp Asn 
355 360 365 

Pro Glu Val Asp Ala His Leu Lys Ser Asp Gly Tyr Thr Asn Phe Ser 
370 375 380 

Leu Arg Thr Leu Asp Ser Gly Lys Arg Gin Cys Lys Glu Ala Leu Gin 
385 390 395 400 

Arg Glu Leu Gly Leu Gin Val Arg Ala Asp Val Pro Leu Leu Gly Phe 

405 410 415 

He Gly Arg Leu Asp Gly Gin Lys Gly Val Glu He He Ala Asp Ala 

420 425 430 

Met Pro Trp He Val Ser Gin Asp Val Gin I^u Val Met Leu Gly Thr 
435 440 445 

Gly Arg His Asp Leu Glu Ser Met Leu Gin His Phe Glu Arg Glu His 
450 455 460 

His Asp Lys Val Arg Gly Trp Val Gly Phe Ser Val Arg Leu Ala His 
465 470 475 480 

Arg He Thr Ala Gly Ala Asp Ala Leu Leu Met Pro Ser Arg Phe Val 

485 490 495 

Pro Cys Gly Leu Asn Gin Leu Tyr Ala Met Ala Tyr Gly Thr Val Pro 

500 505 510 

Val Val His Ala Val Gly Gly Leu Arg Asp Thr Val Pro Pro Phe Asp 
515 520 525 

Pro Phe Asn His Ser Gly Leu Gly Trp Thr Phe Asp Arg Ala Glu Ala 
530 535 540 

His Lys Leu He Glu Ala Leu Gly His Cys Leu Arg Thr Tyr Arg Asp 
545 550 555 560 

Phe Lys Glu Ser Trp Arg Ala Leu Gin Glu Arg Gly Met Ser Gin Asp 

565 570 575 

Phe Ser Trp Glu His Ala Ala Lys Leu Tyr Glu Asp Val Leu Val Lys 

580 585 590 

Ala Lys Tyr Gin Trp 
595 



<210> 7 

<211> 5346 

<212> DNA 

<213> Triticum aestivum 

<220> 
<221> CDS 

<222> (29) . . (4912) 
<400> 7 

cggcacgagg tttagtaggt tccgggaa atg gag atg tct etc tgg cca egg 52 

Met Glu Met Ser Leu Trp Pro Arg 
1 5 

age ccc ctg tgc cct egg age agg cag cog etc gtc gtc gtc egg ccg 100 
Ser Pro Leu Cys Pro Arg Ser Arg Gin Pro Leu Val Val Val Arg Pro 



i 

I 
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10 15 20 

gcc ggc cgc ggc ggc etc acg cag cct ttt ttg atg aat ggc aga ttt 148 
Ala Gly Arg Gly Gly Leu Thr Gin Pro Phe Leu Met Asn Gly Arg Phe 
25 30 35 40 

act cga age agg acc ctt cga tgc atg gta gca agt tea gat cct cct 196 
Thr Arg Ser Arg Thr Leu Arg Cys Met Val Ala Ser Ser Asp Pro Pro 

45 50 55 

aat agg aaa tea aga agg atg gta cca cct cag gtt aaa gtc att tct 244 
Asn Arg Lys Ser Arg Arg Met Val Pro Pro Gin Val Lys Val lie Ser 

60 65 70 

tct aga gga tat acg aca aga etc att gtt gaa eca age aae gag aat 292 
Ser Arg Gly Tyr Thr Thr Arg Leu He Val Glu Pro Ser Asn Glu Asn 
75 80 85 

aca gaa cac aat aat egg gat gaa gaa act ctt gat aca tac aat gcg 340 
Thr Glu His Asn Asn Arg Asp Glu Glu Thr Leu Asp Thr Tyr Asn Ala 
90 95 100 

eta tta agt acc gag aca gca gaa tgg aca gat aat aga gaa gcc gag 388 
Leu Leu Ser Thr Glu Thr Ala Glu Trp Thr Asp Asn Arg Glu Ala Glu 
105 110 115 120 

act get aaa gcg gac teg teg caa aat get tta age agt tct ata att 436 
Thr Ala Lys Ala Asp Ser Ser Gin Asn Ala Leu Ser Ser Ser He He 

125 130 135 

ggg gaa gtg gat gtg gcg gat gaa gat ata ett gcg get gat etg aca 4 84 
Gly Glu Val Asp Val Ala Asp Glu Asp He Leu Ala Ala Asp Leu Thr 

140 145 150 

gtg tat tea ttg age agt gta atg aag aag gaa gtg gat gca gcg gac 532 
Val Tyr Ser Leu Ser Ser Val Met Lys Lys Glu Val Asp Ala Ala Asp 
155 160 165 

aaa get aga gtt aaa gaa gac gca ttt gag etg gat ttg cca gca act 580 
Lys Ala Arg Val Lys Glu Asp Ala Phe Glu Leu Asp Leu Pro Ala Thr 
170 175 180 



aca ttg aga agt gtg ata gta gat gtg atg gat cat aat ggg act gta 
Thr Leu Arg Ser Val He Val Asp Val Met Asp His Asn Gly Thr Val 

IOC irtrt trtc 



628 



caa gag aca ttg aga agt gtg ata gta gat gtg atg gat cat aat ggg 676 

Gin Glu Thr Le'i Arg Ser Val He Val Asp Val Met Asp His Asn Gly 

205 210 215 

act gta caa gag aca ttg aga agt gtg ata gta gat gtg atg gat gat 724 

Thr Val Gin Glu Thr Leu Arg Ser Val He Val Asp Val Met Asp Asp 

220 225 230 

gcg gcg gac aaa get aga gtt gaa gaa gac gta ttt gag etg gat ttg 772 

Ala Ala Asp Lys Ala Arg Val Glu Glu Asp Val Phe Glu Leu Asp Leu 

235 240 245 

tea gga aat att tea age agt gcg acg acc gtg gaa eta gat gcg gtt 820 

Ser Gly Asn He Ser Ser Ser Ala Thr Thr Val Glu Leu Asp Ala Val 

250 255 ■ 260 

gac gaa gtc ggg cct gtt caa gac aaa ttt gag gcg acc tea tea gga 868 

Asp Glu Val Gly Pro Val Gin Asp Lys Phe Glu Ala Thr Ser Ser Gly 

265 270 275 280 



wo 00/66745 



PCT/AUOO/00385 



-20- 

agt caa gat ctt teg get gtt agt etc cct aaa caa aac gta cca att 1684 
Ser Gin Asp Leu Ser Ala Val Ser Leu Pro Lys Gin Asn Val Pro He 

540 545 550 

gtt ggt acg teg aga gag ggt caa aca aag caa gtt cct gtt gtt gat 1732 
Val Gly Thr Ser Arg Glu Gly Gin Thr Lys Gin Val Pro Val Val Asp 
555 560 565 

aga cag gat gca ttg tat gtg aat gga ctg gaa get aag gag gga gat 1780 
Arg Gin Asp Ala Leu Tyr Val Asn Gly Leu Glu Ala Lys Glu Gly Asp 
570 575 580 

cac aca tec gag aaa act gat gag gat geg ctt cat gta aag ttt aat 1828 
His Thr Ser Glu Lys Thr Asp Glu Asp Ala Leu His Val Lys Phe Asn 
585 590 595 600 

gtt gae aat gtg ttg egg aag cat cag gca gat aga acc caa gca gtg 1876 
Val Asp Asn Val Leu Arg Lys His Gin Ala Asp Arg Thr Gin Ala Val 

605 610 615 

gaa aag aaa act tgg aag aaa gtt gat gag gaa cat ctt tac atg act 192 4 
Glu Lys Lys Thr Trp Lys Lys Val Asp Glu Glu His Leu Tyr Met Thr 

620 625 630 

gaa cat cag aaa cgt get gcc gaa gga cag atg gta gtt aac gag gat 1972 
Glu His Gin Lys Arg Ala Ala Glu Gly Gin Met Val Val Asn Glu Asp 
635 640 645 

gag ctt tct ata act gaa att gga atg ggg aga ggt gat aaa att cag 2020 
Glu Leu Ser He Thr Glu He Gly Met Gly Arg Gly Asp Lys He Gin 
650 655 660 

cat gtg ctt tct gag gaa gag ctt tea tgg tct gaa gat gaa gtg cag 2068 
His Val Leu Ser Glu Glu Glu Leu Ser Trp Ser Glu Asp Glu Val Gin 
6^5 670 675 680 

tta att gag gat gat gga caa tat gaa gtt gac gag acc tct gtg tec 2116 
Leu He Glu Asp Asp Gly Gin Tyr Glu Val Asp Glu Thr Ser Val Ser 

685 690 695 

gtt aac gtt gaa caa gat ate cag ggg tea cca cag gat gtt gtg gat 2164 
Val Asn Val Glu Gin Asp He Gin Gly Ser Pro Gin Asp Val Val Asp 

700 705 710 

ccg caa gca eta aag gtg atg ctg caa gaa etc get gag aaa aat tat 2212 
Pro Gin Ala Leu Lys Val Met Leu Gin Glu Leu Ala Glu Lys Asn Tyr 
715 720 725 

teg atg agg aac aag ctg ttt gtt ttt cca cr^tg gta gtg aaa get gat 2260 
Ser Met Arg Asn Lys Leu Phe Val Phe Pro Glu Val Val Lys Ala Asp 
730 735 740 

tea gtt att gat ctt tat tta aat cgt gac eta aca get ttg gcg aat 2308 
Ser Val He Asp Leu Tyr Leu Asn Arg Asp Leu Thr Ala Leu Ala Asn 
■^45 750 • 755 



760 



gaa ccc gat gtc gtc ate aaa gga gca ttc aat ggt tgg aaa tgg agg 2356 

Glu Pro Asp Val Val He Lys Gly Ala Phe Asn Gly Trp Lys Trp Arg 

765 770 775 

ctt ttc act gaa aga ttg cac aag agt gae ctt gga ggg gtt tgg tgg 2404 

Leu Phe Thr Glu Arg Leu His Lys Ser Asp Leu Gly Gly Val Trp Trp 

780 785 790 

tct tgc aaa ctg tac ata ccc aag gag gee tac aga tta gae ttt gtg 2452 
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aat gtt tea aac agt gca acg gta egg gaa gtg gat gca agt gat gaa 916 
Asn Val Set Asn Ser Ala Thr Val Arg Glu Val Asp Ala Ser Asp Glu 

283 290 295 

get ggg aat gat caa ggc ata ttt aga gca gat ttg tea gga aat gtt 964 
Ala Gly Asn Asp Gin Gly lie Phe Arg Ala Asp Leu Ser Gly Asn Val 

300 305 310 

ttt tea age agt aca aca gtg gaa gtg ggt gea gtg gat gaa get ggg 1012 
Phe Ser Ser Ser Thr Thr Val Glu Val Gly Ala Val Asp Glu Ala Gly 
315 320 325 

tct ata aag gae agg ttt gag aeg gat teg tea gga aat gtt tea aca 1060 
Ser lie Lys Asp Arg Phe Glu Thr Asp Ser Ser Gly Asn Val Ser Thr 
330 335 340 

agt gcg ccg atg tgg gat gca att gat gaa acc gtg get gat caa gac 1108 
Ser Ala Pro Met Trp Asp Ala lie Asp Glu Thr Val Ala Asp Gin Asp 
345 . 350 355 360 

aca ttt gag gcg gat ttg teg gga aat get tea age tgc gca aca tac 1156 
Thr Phe Glu Ala Asp Leu Ser Gly Asn Ala Ser Ser Cys Ala Thr Tyr 

365 370 375 

aga gaa gtg gat gat gtg gtg gat gaa act aga tea gaa gag gaa aca 1204 
Arg Glu Val Asp Asp Val Val Asp Glu Thr Arg Ser Glu Glu Glu Thr 

380 385 390 

ttt gca atg gat ttg ttt gea agt gaa tea ggc eat gag aaa eat atg 1252 
Phe Ala Met Asp Leu Phe Ala Ser Glu Ser Gly His Glu Lys His Met 
395 400 405 

gea gtg gat tat gtg ggt gaa get acc gat gaa gaa gag act tac caa 1300 
Ala Val Asp Tyr Val Gly Glu Ala Thr Asp Glu Glu Glu Thr Tyr Gin 
410 415 420 

cag caa tat cca gta ccg tct tea ttc tct atg tgg gac aag get att 134 8 
Gin Gin Tyr Pro Val Pro Ser Ser Phe Ser Met Trp Asp Lys Ala lie 
425 430 435 440 

get aaa aca ggt gta agt ttg aat cct gag ctg cga ctt gtc agg gtt 1396 
Ala Lys Thr Gly Val Ser Leu Asn Pro Glu Leu Arg Leu Val Arg Val 

445 450 455 

gaa gaa caa ggc aaa gta aat ttt agt gat aaa aaa gae ctg tea att 1444 
Glu Glu Gin Gly Lys Val Asn Phe Ser Asp Lys Lys Asp Leu Ser He 

460 465 470 

gat gat tta cca gga caa aac caa teg ate att ggt tec tat aaa caa 1492 
Asp Asp Leu Pro Gly Gin Asn Gin Ser lie He Gly Ser Tyr Lys Gin 
475 480 485 

gat aaa tea att get gat gtt gcg gga ccg acc caa tea att ttt ggt 1540 
Asp Lys Ser He Ala Asp Val Ala Gly Pro Thr Gin Ser He Phe Gly 
490 495 500 

tct agt aaa caa cac egg tea att gtt get ttc cec aaa caa aac cag 1588 
Ser Ser Lys Gin His Arg Ser He Val Ala Phe Pro Lys Gin Asn Gin 
505 510 515 520 

tea att gtt agt gtc act gag caa aag cag tec ata gtt gga ttc cgt 1636 
Ser He Val Ser Val Thr Glu Gin Lys Gin Ser He Val Gly Phe Arg 

525 530 535 
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1050 1055 1060 

ctg ctt tec cag aaa cac att gtt tac acc gaa ccg ctt gaa ata cgt 3268 
Leu Leu Ser Gin Lys His He Val Tyr Thr Glu Pro Leu Glu He Arg 
1065 .1070 1075 1080 

gcc gga acc aca gtg gat gtg eta tac aat ccc tct aac aca gtg eta 3316 
Ala Gly Thr Thr Val Asp Val Leu Tyr Asn Pro Ser Asn Thr Val Leu 

1085 1090 1095 



aat gga aag teg gag ggt tgg ttt aga tgc tec ttt aac ctt tgg atg 
Asn Gly Lys Ser Glu Gly Trp Phe Arg Cys Ser Phe Asn Leu Trp Met 

1100 1105 . 1110 



3364 



cat tea agt ggg gca ttg cea ccc cag aag atg gtg aaa tea ggg gat 3412 
His Ser Ser Gly Ala Leu Pro Pro Gin Lys Met Val Lys Ser Glv Asd 
1115 1120 1125 

ggg ccg etc tta aaa gca aca gtt gat gtt cea ccg gat gee tat atg 34 60 
Gly Pro Leu Leu Lys Ala Thr Val Asp Val Pro Pro Asp Ala Tyr Met 
1130 1135 1140 

atg gac ttt gtt ttc tec gag tgg gaa gaa gat ggg ate tat gac aac 3508 
Met Asp Phe Val Phe Ser Glu Trp Glu Glu Asp Gly He Tyr Asp Asn 
1145 1150 1155 1160 

agg aat ggg atg gac tat eat att ect gtt tct gat tea att gaa aca 3556 
Arg Asn Gly Met Asp Tyr His He Pro Val Ser Asp Ser He Glu Thr 

1165 1170 1175 

gag aat tac atg cgt att ate cac att gcc gtt gag atg gcc ccc gtt 3604 
Glu Asn Tyr Met Arg He He His He Ala Val Glu Met Ala Pro Val 

1180 1185 1190 

gca aag gtt gga ggt ctt ggg gat gtt gtt aca agt ctt tea cgt gee 3652 
Ala Lys Val Gly Gly Leu Gly Asp Val Val Thr Ser Leu Ser Arg Ala 
1195 1200 1205 

att caa gat eta gga cat act gtc gag gtt att etc ccg aag tac gac 3700 
He Gin Asp Leu Gly His Thr Val Glu Val He Leu Pro Lys Tyr Asp 
1210 1215 1220 

tgt ttg aac caa age agt gtc aag gat tta eat tta tat. caa agt ttt 3748 
Cys Leu Asn Gin Ser Ser Val Lys Asp Leu His Leu Tyr Gin Ser Phe 
1225 1230 1235 1240 

tct tgg ggt ggt aca gaa ata aaa gta tgg gtt gga cga gtc gaa gac 37 96 
Ser Trp Gly Gly Thr Glu He Lys Val Trp Val Gly Arg Val Glu Asp 

1245 1250 1255 

ctg acc gtt tac ttc ctg gaa ect caa aat ggg atg ttt ggc gtt gga 3844 
Leu Thr Val Tyr Phe Leu Glu Pro Gin Asn Gly Met Phe Gly Val Gly 

1260 1265 1270 

tgt gta tat gga agg aat gat gac cgc aga ttt ggg ttc ttc tgt cat 3892 
Cys Val Tyr Gly Arg Asn Asp Asp Arg Arg Phe Gly Phe Phe Cys His 
1275 1280 1285 

tct get eta gag ttt ate etc cag aat gaa ttt tct cea eat ata ata 3940 
Ser Ala Leu Glu Phe He Leu Gin Asn Glu Phe Ser Pro His He He 
1290 1295 1300 

cat tgc eat gat tgg tea agt get ccg gtc gcc tgg eta tat aag gaa 3988 
His Cys His Asp Trp Ser Ser Ala Pro Val Ala Trp Leu Tyr Lys Glu 
1305 1310 1315 1320 
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Ser Cys Lys Leu Tyr lie Pro Lys Glu Ala Tyr Arg Leu Asp Phe Val 
795 800 805 

ttc ttc aac ggt cgc\acg gtc tat gag aac aat ggc aac aat gat ttc 2500 
Phe Phe Asn Gly Arg Thr Val Tyr Glu Asn Asn Gly Asn Asn Asp Phe 
810 815 820 

tgt ata gga ata gaa ggc act atg aat gaa gat ctg ttt gag gat ttc 2548 
Cys lie Gly lie Glu Gly Thr Met Asn Glu Asp Leu Phe Glu Asp Phe 
825 830 835 840 

ttg gtt aaa gaa aag caa agg gag ctt gag aaa ctt gcc atg gaa gaa 2596 
Leu Val Lys Glu Lys Gin Arg Glu Leu Glu Lys Leu Ala Met Glu Glu 

845 850 855 

get gaa agg agg aca cag act gaa gaa cag egg ega aga aag gaa gca 2 64 4 
Ala Glu Arg Arg Thr Gin Thr Glu Glu Gin Arg Arg Arg Lys Glu Ala 

860 865 870 

agg get gca gat gaa get gtc agg gca eaa geg aag gcc gag ata gag 2692 
Arg Ala Ala Asp Glu Ala Val Arg Ala Gin Ala Lys Ala Glu lie Glu 
875 880 885 

ate aag aag aaa aaa ttg eaa agt atg ttg agt ttg gcc aga aca tgt 2740 
lie Lys Lys Lys Lys Leu Gin Ser Met Leu Ser Leu Ala Arg Thr Cys 
890 895 900 

gtt gat aat ttg tgg tac ata gag get age aca gat aca aga gga gat 2788 
Val Asp Asn Leu Trp Tyr lie Glu Ala Ser Thr Asp Thr Arg Gly Asp 
905 910 915 920 

act ate agg tta tat tat aac aga aac teg agg eca ctt geg eat agt 2836 
Thr lie Arg Leu Tyr Tyr Asn Arg Asn Ser Arg Pro Leu Ala His Ser 

925 930 935 

act gag att tgg atg cat ggt ggt tac aac aat tgg aca gat gga etc 2884 
Thr Glu lie Trp Met His Gly Gly Tyr Asn Asn Trp Thr Asp Gly Leu 

940 945 . 950 

tct att gtt gaa age ttt gtc aag tgc aat gae aaa gac ggc gat tgg 2932 
Ser lie Val Glu Ser Phe Val Lys Cys Asn Asp Lys Asp Gly Asp Trp 



955 960 965 

tgg tat gca gat gtt att eca cct gaa aag gca ctt gtg ttg gac tgg 
Trp Tyr Ala Asp Val lie Pro Pro Glu Lys Ala Leu Val Leu Asp Trp 



970 975 980 



2980 



gtt ttt get gat ggg cea get ggg aat gca agg aac tat gac aac aat 3028 
Val Phe Ala Asp Gly Pro Ala Gly Asn Ala Arg Asn Tyr Asp Asn Asn 
985 990 995 1000 

get ega caa gat ttc cat get att ctt ccg aac aac aat gta ace gag 307 6 
Ala Arg Gin Asp Phe His Ala lie Leu Pro Asn Asn Asn Val Thr Glu 

1005 1010 1015 

gaa ggc ttc tgg geg caa gag gag caa aac ate tat aca agg ctt ctg 3124 
Glu Gly Phe Trp Ala Gin Glu Glu Gin Asn lie Tyr Thr Arg Leu Leu 

1020 1025 1030 

caa gaa agg aga gaa aag gaa gaa acc atg aaa aga aag get gag aga 3172 
Gin Glu Arg Arg Glu Lys Glu Glu Thr Met Lys Arg Lys Ala Glu Arg 
1035 1040 1045 

agt gca aat ate aaa get gag atg aag gca aaa act atg ega agg ttt 3220 
Ser Ala Asn lie Lys Ala Glu Met Lys Ala Lys Thr Met Arg Arg Phe 
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aat ggc gtg gat tat gcc etc aac aga gca ate ggc get tgg ttc gat 4804 
Asn Gly Val Asp Tyr Ala Leu Asn Arg Ala lie Gly Ala Trp Phe Asp 

1580 1585 1590 

gcc cgt gat tgg ttc cac tec ctg tgt aag agg gtc atg gag caa gac 4852 
Ala Arg Asp Trp- Phe His Ser Leu Cys Lys Arg Val Met Glu Gin Asp 
1595 1600 1605 

tgg teg tgg aac egg ecc gca ctg gac tac att gaa ttg tac eat gee 4 900 
Trp Ser Trp Asn Arg Pro Ala Leu Asp Tyr lie Glu Leu Tyr His Ala 
1610 1615 1620 

get cga aaa ttc tgacacccaa ctgaaccaat gacaagaaca agcgcattgt 4 952 

Ala Arg Lys Phe 

1625 

gggatcgact agteatacag ggetgtgcag atcgtcttgc ttcagttagt gccctcttca 5012 
gttagttcca agcgcactac agtegtacat agetgaggat eetcttgcct cctaccaggg 5072 
ggaacaaagc agaaatgcat gagtgeattg ggaagacttt tatgtatatt gttaaaaaaa 5132 
ttteettttc ttttccttce ctgcacctgg aaatggttaa gegeatcgcc gagataagaa 5192 
ccgcagtgac attctgtgag tagctttgta tattctctca tcttgtgaaa actaatgttc 5252 
atgttagget gtctgatcat gtggaagctt tgttatatgt taettatggt atacateaat 5312 
gatatttaea tttgtggaaa aaaaaaaaaa aaaa 534 6 

<210> 8 
<211> 1628 
<212> PRT 

<213> Triticum aestivum 
<400> 8 

Met Glu Met Ser Leu Trp Pro Arg Ser Pro Leu Cys Pro Arg Ser Arg 
15 10 15 

Gin Pro Leu Val Val Val Arg Pro Ala Gly Arg Gly Gly Leu Thr Gin 

20 25 30 

Pro Phe Leu. Met Asn Gly Arg Phe Thr Arg Ser Arg Thr Leu Arg Cys 
35 40 45 

Met Val Ala Ser Ser Asp Pro Pro Asn Arg Lys Ser Arg Arg Met Val 
50 55 60 

Pro Pro Gin Val Lys Val lie Ser Ser Arg Gly Tyr Thr Thr Arg Leu 
65 70 75 80 

lie Val Glu Pro Ser Asn Glu Asn Thr Glu His Asn Asn Arg Asp Glu 

85 90 95 

Glu Thr Leu Asp Thr Tyr Asn Ala Leu Leu Ser Thr Glu Thr Ala Glu 

100 105 110 

Trp Thr Asp Asn Arg Glu Ala Glu Thr Ala Lys Ala Asp Ser Ser Gin 
115 120 125 

Asn Ala Leu Ser Ser Ser lie lie Gly Glu Val Asp Val Ala Asp Glu 
130 135 140 

Asp lie Leu Ala Ala Asp Leu Thr Val Tyr Ser Leu Ser Ser Val Met 
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cac tat tec caa tec aga atg gca age act egg gtt gta ttt acc ate 4036 
His Tyr Ser Gin Ser Arg Met Ala Ser Thr Arg Val Val Phe Thr He 

1325 1330 1335 

cac aat ctt gaa ttt gga gca cat tat att ggt aaa gca atg aca tac 4084 
His Asn Leu Glu Phe Gly Ala His Tyr He Gly Lys Ala Met Thr Tyr 

1340 1345 1350 

tgt gat aaa gee aca act gtt tet ect aca tat tea agg gac gtg gca 4132 
Cys Asp Lys Ala Thr Thr Val Ser Pro Thr Tyr Ser Arg Asp Val Ala 
1355 1360 1365 

ggc eat ggc gee att get ect eat egt gag aaa ttc tac gge att etc 4180 
Gly His Gly Ala He Ala Pro His Arg Glu Lys Phe Tyr Gly He Leu 
1370 1375 1380 

aat gga att gat cca gat ate tgg gat ccg tac act gac aat ttt ate 4228 
Asn Gly He Asp Pro Asp He Trp Asp Pro Tyr Thr Asp Asn Phe He 
1385 1390 1395 1400 

ccg gtc ect tat act tgt gag aat gtt gtc gaa ggc aag aga get gca 4276 
Pro Val Pro Tyr Thr Cys Glu Asn Val Val Glu Gly Lys Arg Ala Ala 

1405 1410 1415 

aaa agg gee ttg eag eag aag ttt gga tta cag caa act gat gtc ect 4 324 
Lys Arg Ala Leu Gin Gin Lys Phe Gly Leu Gin Gin Thr Asp Val Pro 

1420 1425 1430 

att gtc gga ate ate ace egt etg- aca gee eag aag gga ate cac etc 4372 
He Val Gly He He Thr Arg Leu Thr Ala Gin Lys Gly He His Leu 
1435 1440 1445 

ate aag cac gca att cac cga act etc gaa age aae gga cat gtg gtt 4420 
lie Lys His Ala He His Arg Thr Leu Glu Ser Asn Gly His Val Val 
1450 1455 1460 

ttg ctt ggt tea get cca gat cat cga ata caa ggc gat ttt tgc aga 

Leu Leu 
14 65 



ggt tea get cca gat cat cg^ dud t^aa yy- yau ui-u uy^- 
Gly Ser Ala Pro Asp His Arg He Gin Gly Asp Phe Cys Arg 

1470 1475 1480 



tet ctt ggt ctt gaa cca aat ggg ttc agt ttc gac gga gee gac age 4756 
Ser Leu Gly Leu Glu Pro Asn Gly Phe Ser Phe Asp Gly Ala Asp Ser 

1565 1570 1575 



4468 



ttg gee gat get ctt cat ggt gtt tac cat ggt agg gtg aag ctt gtt 4516 
Leu Ala Asp Ala Leu His Gly Val Tyr His Gly Arg Val Lys Leu Val 

1485 1490 1495 

eta ace tat gat gag ect ctt tet cac etg ata tac get ggc teg gac 4564 
Leu Thr Tyr Asp Glu Pro Leu Ser His Leu He Tyr Ala Gly Ser Asp 

1500 1505 1510 

ttc ata att gtt ect tea ate ttc gaa cec tgt gge tta aca caa ctt 4 612 
Phe He He Val Pro Ser He Phe Glu Pro Cys Gly Leu Thr Gin Leu 
1515 1520 1525 

gtt gee atg egt tat gga teg ate ect ata gtt egg aaa act gga gga 4660 
Val Ala Met Arg Tyr Gly Ser He Pro He Val Arg Lys Thr Gly Gly 
1530 1535 1540 

ctt cac gac aca gtc ttc gac gta gac aat gat aag gac egg get egg 4'708 
Leu His Asp Thr Val Phe Asp Val Asp Asn Asp Lys Asp Arg Ala Arg 
1545 1550 1555 1560 
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Gly Pro Thr Gin Ser lie Phe Gly Ser Ser Lys Gin His Arg Ser He 

500 305 510 

Val Ala Phe Pro Lys Gin Asn Gin Ser He Val Ser Val Thr Giu Gin 
515 520 525 

Lys Gin Ser He Val Gly Phe Arg Ser Gin Asp Leu Ser Ala Val Ser 
530 533 540 

Leu Pro Lys Gin Asn Val Pro He Val Gly Thr Ser Arg Glu Gly Gin 
545 530 355 560 

Thr Lys Gin Val Pro Val Val Asp Arg Gin Asp Ala Leu Tyr Val Asn 

565 570 375 

Gly Leu Giu Ala Lys Glu Gly Asp His Thr Ser Glu Lys Thr Asp Glu 

580 583 590 

Asp Ala Leu His Val Lys Phe Asn Val Asp Asn Val Leu Arg Lys His 
595 600 605 

Gin Ala Asp Arg Thr Gin Ala Val Glu Lys Lys Thr Trp Lys Lys Val 
610 615 620 

Asp Giu Giu His Leu Tyr Met Thr Giu His Gin Lys Arg Ala Ala Giu 
625 ' 630 635 640 

Gly Gin Met Val Val Asn Glu Asp Glu Leu Ser He Thr Giu He Gly 

645 650 655 

Met Gly Arg Gly Asp Lys lie Gin His Val Leu Ser Glu Glu Glu Leu 

660 665 670 

Ser Trp Ser Giu Asp Glu Val Gin Leu He Glu Asp Asp Gly Gin Tyr 
675 680 685 

Glu Val Asp Giu Thr Ser Val Ser Val Asn Val Glu Gin Asp lie Gin 
690 695 700 

Gly Ser Pro Gin Asp Val Val Asp Pro Gin Ala Leu Lys Val Met Leu 
705 710 715 720 

Gin Giu Leu Ala Giu Lys Asn Tyr Ser Met Arg Asn Lys Leu Phe Val 

725 730 735 

Phe Pro Giu Val Val Lys Ala Asp Ser Val He Asp Leu Tyr Leu Asn 

740 745 750 

Arg Asp Leu Thr Ala Leu Ala Asn Giu Pro Asp Val Val He Lys Gly 
755 760 765 

Ala Phe Asn Gly Trp Lys Trp Arg Leu Phe Thr Glu Arg Leu His Lys 
770 775 780 

Ser Asp Leu Gly Gly Val Trp Trp Ser Cys Lys Leu Tyr He Pro Lys 
785 790 795 800 

Giu Ala Tyr Arg Leu Asp Phe Val Phe Phe Asn Gly Arg Thr Val Tyr 

805 810 815 

Giu Asn Asn Gly Asn Asn Asp Phe Cys lie Gly He Giu Gly Thr Met 

820 825 830 



Asn Giu Asp Leu Phe Glu Asp Phe Leu Val Lys Glu Lys Gin Arg Glu 
835 840 845 
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145 150 155 160 

Lys Lys Glu Val Asp Ala Ala Asp Lys Ala Arg Val Lys Glu Asp Ala 

165 170 175 

Phe Glu Leu Asp Leu Pro Ala Thr Thr Leu Arg Ser Val lie Val Asp 

180 185 190 

Val Met Asp His Asn Gly Thr Val Gin Glu Thr Leu Arg Ser Val lie 
195 200 205 

Val Asp Val Met Asp His Asn Gly Thr Val Gin Glu Thr Leu Arg Ser 
210 215 220 

Val lie Val Asp Val Met Asp Asp Ala Ala Asp Lys Ala Arg Val Glu 
225 230 235 240 

Glu Asp Val Phe Glu Leu Asp Leu Ser Gly Asn lie Ser Ser Ser Ala 

245 250 255 

Thr Thr Val Glu Leu Asp Ala Val Asp Glu Val Gly Pro Val Gin Asp 

260 265 270 

Lys Phe Glu Ala Thr Ser Ser Gly Asn Val Ser Asn Ser Ala Thr Val 
275 280 285 

Arg Glu Val Asp Ala Ser Asp Glu Ala Gly Asri Asp Gin Gly lie Phe 
290 295 300 

Arg Ala Asp Leu Ser Gly Asn Val Phe Ser Ser Ser Thr Thr Val Glu 
305 310 315 320 

Val Gly Ala Val Asp Glu Ala Gly Ser lie Lys Asp Arg Phe Glu Thr 

325 330 335 

Asp Ser Ser Gly Asn Val Ser Thr Ser Ala Pro Met Trp Asp Ala lie 

340 345 350 

Asp Glu Thr Val Ala Asp Gin Asp Thr Phe Glu Ala Asp Leu Ser Gly 
355 360 365 

Asn Ala Ser Ser Cys Ala Thr Tyr Arg Glu Val Asp Asp Val Val Asp 
370 375 380 

Glu Thr Arg Ser Glu Glu Glu Thr Phe Ala Met Asp Leu Phe Ala Ser 
385 390 395 400 

Glu Ser Gly His Glu Lys His Met Ala Val Asp Tyr Val Gly Glu Ala 

405 410 415 

Thr Asp Glu Glu Glu Thr Tyr Gin Gin Gin Tyr Pro Val Pro Ser Ser 

420 425 430 

Phe Ser Met Trp Asp Lys Ala lie Ala Lys Thr Gly Val Ser Leu Asn 
435 440 445 

Pro Glu Leu Arg Leu Val Arg Val Glu Glu Gin Gly Lys Val Asn Phe 
450 455 460 

Ser Asp Lys Lys Asp Leu Ser lie Asp Asp Leu Pro Gly Gin Asn Gin 
465 470 475 • 480 



Ser lie lie Gly Ser Tyr Lys Gin Asp Lys Ser lie Ala Asp Val Ala 

485 490 495 
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185 1190 1195 1200 

Val Val Thr Ser Leu Ser Arg Ala lie Gin Asp Leu Gly His Thr Val 

1205 1210 1215 

Glu Val lie Leu Pro Lys Tyr Asp Cys Leu Asn Gin Ser Ser Val Lys 

1220 1225 1230 

Asp Leu His Leu Tyr Gin Ser Phe Ser Trp Gly Gly Thr Glu He Lys 
1235 1240 1245 

Val Trp Val Gly Arg Val Glu Asp Leu Thr Val Tyr Phe Leu Glu Pro 
1250 1255 1260 

Gin Asn Gly Met Phe Gly Val Gly Cys Val Tyr Gly Arg Asn Asp Asp 
265 1270 1275 1280 

Arg Arg Phe Gly Phe Phe Cys His Ser Ala Leu Glu Phe He Leu Gin 

1285 1290 1295 

Asn Glu Phe Ser Pro His He lie His Cys His Asp Trp Ser Ser Ala 

1300 1305 1310 

Pro Val Ala Trp Leu Tyr Lys Glu His Tyr Ser Gin Ser Arg Met Ala 
1315 1320 1325 

Ser Thr Arg Val Val Phe Thr He His Asn Leu Glu Phe Gly Ala His 
1330 1335 1340 

Tyr He Gly Lys Ala Met Thr Tyr Cys Asp Lys Ala Thr Thr Val Ser 
345 1350 1355 1360 

Pro Thr Tyr Ser Arg Asp Val Ala Gly His Gly Ala He Ala Pro His 

1365 1370 1375 

Arg Glu Lys Phe Tyr Gly He Leu Asn Gly He Asp Pro Asp He Trp 

1380 1385 1390 

Asp Pro Tyr Thr Asp Asn Phe He Pro Val Pro Tyr Thr Cys Glu Asn 
1395 1400 1405 

Val Val Glu Gly Lys Arg Ala Ala Lys Arg Ala Leu Gin Gin Lys Phe 
1410 1415 1420 

Gly Leu Gin Gin Thr Asp Val Pro He Val Gly He He Thr Arg Leu 
425 1430 1435 1440 

Thr Ala Gin Lys Gly He His Leu He Lys His Ala He His Arg Thr 

1445 1450 1455 

Leu Glu Ser Asn Gly His Val Val Leu Leu Gly Ser Ala Pro Asp His 

1460 1465 1470 

Arg He Gin Gly Asp Phe Cys Arg Leu Ala Asp Ala Leu His Gly Val 
1475 1480 1485 

Tyr His Gly Arg Val Lys Leu Val Leu Thr Tyr Asp Glu Pro Leu Ser 
1490 1495 1500 

His Leu He Tyr Ala Gly Ser Asp Phe He He Val Pro Ser He Phe 
505 1510 1515 1520 



Glu Pro Cys Gly Leu Thr Gin Leu Val Ala Met Arg Tyr Gly Ser He 

1525 1530 1535 
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Leu Glu Lys Leu Ala Met Glu Glu Ala Glu Arg Arg Thr Gin Thr Glu 
850 855 860 

Glu Gin Arg Arg Arg Lys Glu Ala Arg Ala Ala Asp Glu Ala Vai Arg 
865 870 875 880 

Ala Gin Ala Lys Ala Glu lie Glu lie Lys Lys Lys Lys Leu Gin Ser 

885 890 895 

Met Leu Ser Leu Ala Arg Thr Cys Val Asp Asn Leu Trp Tyr lie Glu 

900 905 910 

Ala Ser Thr Asp Thr Arg Gly Asp Thr lie Arg Leu Tyr Tyr Asn Arg 
915 920 925 

Asn Ser Arg Pro Leu Ala His Ser Thr Glu lie Trp Met His Gly Gly 
930 935 940 

Tyr Asn Asn Trp Thr Asp Gly Leu Ser lie Val Glu Ser Phe Val Lys 
945 950 955 960 

Cys Asn Asp Lys Asp Gly Asp Trp Trp Tyr Ala Asp Val lie Pro Pro 

965 970 975 

Glu Lys Ala Leu Val Leu Asp Trp Val Phe Ala Asp Gly Pro Ala Gly 

980 985' 990 

Asn Ala Arg Asn Tyr Asp Asn Asn Ala Arg Gin Asp Phe His Ala lie 
995 1000 1005 

Leu Pro Asn Asn Asn Val Thr Glu Glu Gly Phe Trp Ala Gin Glu Glu 
1010 1015 1020 

Gin Asn lie Tyr Thr Arg Leu Leu Gin Glu Arg Arg Glu Lys Glu Glu 
025 1030 1035 1040 

Thr Met Lys Arg Lys Ala Glu Arg Ser Ala Asn lie Lys Ala Glu Met 

1045 ■ ■• 1050 1055 

Lys Ala Lys Thr Met Arg Arg Phe Leu Leu Ser Gin Lys His He Val 

1060 1065 1070 

Tyr Thr Glu Pro Leu Glu He Arg Ala Gly Thr Thr Val Asp Val Leu 
1075 1080 1085 

Tyr Asn Pro Ser Asn Thr Val Leu Asn Gly Lys Ser Glu Gly Trp Phe 
1090 1095 1100 

Arg Cys Ser Phe Asn Leu Trp Met His Ser Ser Gly Ala Leu Pro Pro 
105 1110 1115 1120 

Gin Lys Met Val Lys Ser Gly Asp Gly Pro Leu Leu Lys Ala Thr Val 

1125 1130 1135 

Asp Val Pro Pro Asp Ala Tyr Met Met Asp Phe Val Phe Ser Glu Trp 

1140 1145 1150 

Glu Glu Asp Gly He Tyr Asp Asn Arg Asn Gly Met Asp Tyr His He 
1155 1160 1165 

Pro Val Ser Asp Ser He Glu Thr Glu Asn Tyr Met Arg He He His 
1170 1175 1180 



He Ala Val Glu Met Ala Pro Val Ala Lys Val Gly Gly Leu Gly Asp 
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gca eta aag gtg atg ctg caa gaa etc get gag aaa aat tat teg atg 480 
Ala Leu Lys Val Met Leu Gin Glu Leu Ala Glu Lys Asn Tyr Ser Met 

150 155 160 

agg aac aag ctg ttt gtt ttt cca gag gta gtg aaa get gat tea gtt 528 
Arg Asn Lys Leu Phe Val Phe Pro Glu Val Val Lys Ala Asp Ser Val 

165 170 175 

att gat ctt tat tta aat cgt gac eta aca get ttg gcg aat gaa eec 576 
He Asp Leu Tyr Leu Asn Arg Asp Leu Thr Ala Leu Ala Asn Glu Pro 

180 185 190 

gat gtc gtc ate aaa gga gca ttc aat ggt tgg aaa tgg agg ctt ttc 624 
Asp Val Val He Lys Gly Ala Phe Asn Gly Trp Lys Trp Arg Leu Phe 
195 200 205 

act gaa aga ttg cac aag agt gac ctt gga ggg gtt tgg tgg tct tgc 672 
Thr Glu Arg Leu His Lys Ser Asp Leu Gly Gly Val Trp Trp Ser Cys 
210 215 220 

aaa ctg tac ata ccc aag gag gee tac aga tta gac ttt gtg ttc ttc 720 
Lys Leu Tyr He Pro Lys Glu Ala Tyr Arg Leu Asp Phe Val Phe Phe 
225 230 . 235 240 

aac ggt cgc acg gtc tat gag aac aat ggc aac aat gat ttc tgt ata 768 
Asn Gly Arg Thr Val Tyr Glu Asn Asn Gly Asn Asn Asp Phe Cys He 

245 250 255 

gga ata gaa ggc act atg aat gaa gat ctg ttt gag gat ttc ttg gtt 816 
Gly He Glu Gly Thr Met Asn Glu Asp Leu Phe Glu Asp Phe Leu Val 

260 265 270 

aaa gaa aag caa agg gag ctt gag aaa ctt gee atg gaa gaa get gaa 864 
Lys Glu Lys Gin Arg Glu Leu Glu Lys Leu Ala Met Glu *Glu Ala Glu 
275 280 285 

agg agg aca eag act gaa gaa cag egg cga aga aag gaa gca agg get 912 
Arg Arg Thr Gin Thr Glu Glu Gin Arg Arg Arg Lys Glu Ala Arg Ala 
290 295 300 

gca gat gaa get gtc agg gca caa gcg aag gee gag ata gag ate aag 9 60 
Ala Asp Glu Ala Val Arg Ala Gin Ala Lys Ala Glu He Glu He Lys 
305 310 315 320 

aag aaa aaa ttg caa agt atg ttg agt ttg gcc aga aca tgt gtt gat 1008 
Lys Lys Lys Leu Gin Ser Met Leu Ser Leu Ala Arg Thr Cys Val Asp 

325 330 335 

aat ttg tgg tac ata gag. get age aca gat aca aga gga gat act ate 1056 
Asn Leu Trp Tyr He Glu Ala Ser Thr Asp Thr Arg Gly Asp Thr He 

340 345 350 

agg tta tat tat aac aga aac teg agg cca ctt gcg cat agt act gag 1104 
Arg Leu Tyr Tyr Asn Arg Asn Ser Arg Pro Leu Ala His Ser Thr Glu 
355 360 365 

att tgg atg cat ggt ggt tac aac aat tgg tea gat gga etc tct att 1152 
He Trp Met His Gly Gly Tyr Asn Asn Trp Ser Asp Gly Leu Ser He 
370 375 380 

gtt gaa age ttt gtc aag tgc aat gac aaa gac ggc gat tgg tgg tat 1200 
Val Glu Ser Phe Val Lys Cys Asn Asp Lys Asp Gly Asp Trp Trp Tyr 
385 390 395 400 

gca gat gtt att cca cet gaa aag gca ctt gtg ttg gac tgg gtt ttt 1248 
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Pro lie Val Arg Lys Thr Gly Gly Leu His Asp Thr Val Phe Asp Val 

1540 1545 1550 

Asp Asn Asp Lys Asp Arg Ala Arg Ser Leu Gly Leu Glu Pro Asn Gly 
1555 1560 1565 

Phe Ser Phe Asp Gly Ala Asp Ser Asn Gly Val Asp Tyr Ala Leu Asn 
1570 1575 1580 

Arg Ala He Gly Ala Trp Phe Asp Ala Arg Asp Trp Phe His Ser Leu 
585 1590 1595 1600 

Cys Lys Arg Val Met Glu Gin Asp Trp Ser Trp Asn Arg Pro Ala Leu 

1605 1610 1615 

Asp Tyr He Glu Leu Tyr His Ala Ala Arg Lys Phe 

1620 1625 



<210> 9 
<211> 3621 
<212> DMA 

<213> Triticum aestivum 

<220> 

<221> CDS 

<222> (1) . . (3177) 

<400> 9 

gat gca ttg tat gtg aat gga ctg gaa get aag gag gga gat cac aca 48 

Asp Ala Leu Tyr Val Asn Gly Leu Glu Ala Lys Glu Gly Asp His Thr 
15 10 15 

tec gag aaa act gat gag gat gcg ctt cat gta aag ttt aat gtt gac 96 
Ser Glu Lys Thr Asp Glu Asp Ala Leu His Val Lys Phe Asn Val Asp 

20 25 30 

aat gtg ttg egg aag cat cag gca gat aga acc caa gca gtg gaa aag 144 
Asn Val Leu Arg Lys His Gin Ala Asp Arg Thr Gin Ala Val Glu Lys 
35 40 45 

aaa act tgg aag aaa gtt gat gag gaa cat ctt tac atg act gaa cat 192 
Lys Thr Trp Lys Lys Val Asp Glu Glu His Leu Tyr Met Thr Glu His 
50 55 60 

cag aaa cgt get gee gaa gga eag atg gta gtt aae gag gat gag ctt 24 0 
Gin Lys Arg Ala Ala Glu Gly Gin Met Val Val Asn Glu Asp Glu Leu 
65 70 75 80 

tet ata act gaa att gga atg ggg aga ggt gat aaa att eag eat gtg 288 
Ser He Thr Glu He Gly Met Gly Arg Gly Asp Lys He Gin His Val 

85 90 95 

ctt tct gag gaa gag ctt tea tgg tct gaa gat gaa gtg cag tta att 336 
Leu Ser Glu Glu Glu Leu Ser Trp Ser Glu Asp Glu Val Gin Leu He 

100 105 110 

gag gat gat gga caa tat gaa gtt gac gag acc tct gtg tec gtt aae 384 
Glu Asp Asp Gly Gin Tyr Glu Val Asp Glu Thr Ser Val Ser Val Asn 
115 120 125 



gtt gaa caa gat ate eag ggg tea eca eag gat gtt gtg gat ecg caa 432 
Val Glu Gin Asp He Gin Gly Ser Pro Gin Asp Val Val Asp Pro Gin 
130 135 140 
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660 665 670 

tgg ggt ggt aca gaa ata aaa gta tgg gtt gga cga gtc gaa gac ctg 2064 
Trp Gly Gly Thr Glu lie Lys Val Trp Vai Gly Arg Val Glu Asp Leu 
673 680 685 

acc gtt tac ttc ctg gaa cct caa aat ggg atg ttt ggc gtt gga tgt 2112 
Thr Val Tyr Phe Leu Glu Pro Gin Asn Gly Met Phe Gly Val Gly Cys 
690 695 700 

gta tat gga agg aat gat gac cgc aga ttt ggg ttc ttc tgt cat tct 2160 
Val Tyr Gly Arg Asn Asp Asp Arg Arg Phe Gly Phe Phe Cys His Ser 
705 710 715 720 

get eta gag ttt ate etc cag aat gaa ttt tct cca cat ata ata cat 2208 
Ala Leu Glu Phe lie Leu Gin Asn Glu Phe Ser Pro His lie lie His 

725 730 735 

tgc cat gat tgg tea agt get ccg gtc gee tgg eta tat aag gaa cac 2256 
Cys His Asp Trp Ser Ser Ala Pro Val Ala Trp Leu Tyr Lys Glu His 

740 745 750 

tat tec caa tec aga atg gca age act egg gtt gta ttt ace ate cac 2304 
Tyr Ser Gin Ser Arg Met Ala Ser Thr Arg Val Val Phe Thr lie His 
755 760 765 

aat ett gaa ttt gga gca cat tat att ggt aaa gca atg aca tac tgt 2352 
Asn Leu Glu Phe Gly Ala His Tyr lie Gly Lys Ala Met Thr Tyr Cys 
770 775 780 

gat aaa gee aca act gtt tct cct aca tat tea agg gac gtg gca ggc 2400 
Asp Lys Ala Thr Thr Val Ser Pro Thr Tyr Ser Arg Asp Val Ala Gly 
785 790 795 800 

eat ggc gcc att get cct cat cgt gag aaa ttc tac ggc att etc aat 2448 
His Gly Ala He Ala Pro His Arg Glu Lys Phe Tyr Gly He Leu Asn 

805 810 815 

gga att gat cca gat ate tgg gat ccg tac act gac aat ttt ate ccg 2496 
Gly He Asp Pro Asp He Trp Asp Pro Tyr Thr Asp Asn Phe He Pro 

820 825 830 

gtc cct tat act tgt gag aat gtt gtc gaa ggc aag agg get gca aaa 2544 
Val Pro Tyr Thr Cys Glu Asn Val Val Glu Gly Lys Arg Ala Ala Lys 
835 840 845 

agg gee ttg cag cag aag ttt gga tta cag caa act gat gtc cct att 2592 
Arg Ala Leu Gin Gin Lys Phe Gly Leu Gin Gin Thr Asp Val Pro He 
850 855 860 

gtc gga ate ate ace cgt ctg aca gca cag aag gga ate cac etc ate 2 640 
Val Gly He He Thr Arg Leu Thr Ala Gin Lys Gly He His Leu He 
865 870 875 880 



aag cac gca att cac cga acc etc gag age aat gga caa gtg gtt ttg 
Lys His Ala He His Arg Thr Leu Glu Ser Asn Gly Gin Val Val Leu 



885 890 895 



2688 



ett ggt tea get cca gat cat cga ata caa ggc gat ttt tgc aga ttg 2736 
Leu Gly Ser Ala Pro Asp His Arg He Gin Gly Asp Phe Cys Arg Leu 

900 905 910 



gee gat get ett cac ggt gtt tac cat ggt agg gtg aag ett gtt eta 
Ala Asp Ala Leu His Gly Val Tyr His Gly Arg Val Lys Leu Val Leu 
915 920 925 



2784 
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Ala Asp Val lie Pro Pro Glu Lys Ala Leu Val Leu Asp Trp Vai Phe 

405 410 415 • 

get gat ggg cca get ggg aat gca agg aac tat gac aac aat get cga 1296 
Ala Asp Gly Pro Ala Gly Asn Ala Arg Asn Tyr Asp Asn Asn Ala Arg 

420 425 430 

caa gat ttc cat get att ctt ccg aac aac aat gta acc gag gaa ggc 134 4 
Gin Asp Phe His Ala lie Leu Pro Asn Asn Asn Val Thr Glu Glu Gly 
435 440 445 

ttc tgg gcg caa gag gag caa aac ate tat aca agg ctt ctg eaa gaa 1392 
Phe Trp Ala Gin Glu Glu Gin Asn lie Tyr Thr Arg Leu Leu Gin Glu 
450 455 460 

agg aga gaa aag gaa gaa acc atg aaa aga aag get gag aga agt gca 1440 
Arg Arg Glu Lys Glu Glu Thr Met Lys Arg Lys Ala Glu Arg Ser Ala 
465 470 475 480 

aat ate aaa get gag atg aag gca aaa act atg cga agg ttt ctg ctt 1488 
Asn lie Lys Ala Glu Met Lys Ala Lys Thr Met Arg Arg Phe Leu Leu 

485 490 495 

tec cag aaa cac att gtt tat acc cga acc gnc ttg aaa tac gtg ccc 1536 
Ser Gin Lys His lie Val Tyr Thr Arg Thr Xaa Leu Lys Tyr Val Pro 

500 505 510 

gga ace aca gtg gat gtg eta tac aat ccc tct aac aca gtg eta aat 1584 
Gly Thr Thr Val Asp Val Leu Tyr Asn Pro Ser Asn Thr Val Leu Asn 

515 520 525 



gga aag teg gag ggt tgg ttt aga tgc tee ttt aac ett tgg atg eat 
Gly Lys Ser Glu Gly Trp Phe Arg Cys Ser Phe Asn Leu Trp Met His 



530 • 535 540 



eaa gat eta gga eat act gte gag gtt att etc ccg aag tac gac tgt 
Gin Asp Leu Gly His Thr Val Glu Val He Leu Pro Lys Tyr Asp Cys 



1632 



tea agt ggg gca ttg eca ecc cag aag atg gtg aaa tea ggg gat ggg 1680 
Ser Ser Gly Ala Leu Pro Pro Gin Lys Met Val Lys Ser Gly Asp Gly 
545 550 555 560 

eeg etc tta aaa gca aca gtt gat gtt eca ccg gat gee tat atg atg 1728 
Pro Leu Leu Lys Ala Thr Val Asp Val Pro Pro Asp Ala Tyr Met Met 

565 570 575 

gac ttt gtt ttc tec gag tgg gaa gaa gat ggg ate tat gac aac agg 1776 
Asp Phe Val Phe Ser Glu Trp Glu Glu Asp Gly He Tyr Asp Asn Arg 

580 585 590 

aat ggg atg gac tat eat att cet gtt tct gat tea att gaa aca gag 1824 
Asn Gly Met Asp Tyr His He Pro Val Ser Asp Ser He Glu Thr Glu 
595 , 600 605 

aat tac atg cgt att ate eae att gee gtt gag atg gee ccc gtt gca 1872 
Asn Tyr Met Arg He He His He Ala Val Glu Met Ala Pro Val Ala 
610 615 620 

aag gtt gga ggt ctt ggg gat gtt gtt aca agt ett tea cgt gee att 1920 
Lys Val Gly Gly Leu Gly Asp Val Val Thr Ser Leu Ser Arg Ala. He 
625 630 635 640 



1968 



645 650 655 



ttg aac caa age agt gtc aag gat tta cat tta tat caa agt ttt tct 
Leu Asn Gin Ser Ser Val Lys Asp Leu His Leu Tyr Gin Ser Phe Ser 



2016 
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Lys Thr Trp hys Lys Val Asp Glu Glu His Leu Tyr Met Thr Glu His 
50 55 60 

Gin Lys Arg Ala Ala Glu Gly Gin Met Val Val Asn Glu Asp Glu Leu 

^0 75 80 

Ser lie Thr Glu He Gly Met Gly Arg Gly Asp Lys He Gin His Val 

85 90 95 

Leu Ser Glu Glu Glu Leu Ser Trp Ser Glu Asp Glu Val Gin Leu He 

100 105 xiO 

Glu Asp Asp Gly Gin Tyr Glu Val Asp Glu Thr Ser Val Ser Val Asn 
115 120 125 

Val Glu Gin Asp He Gin Gly Ser Pro Gin Asp Val Val Asp Pro Gin 
130 135 140 

Ala Leu Lys Val Met Leu Gin Glu Leu Ala Glu Lys Asn Tyr Ser Met 

150 155 160 

Arg Asn Lys Leu Phe Val Phe Pro Glu Val Val Lys Ala Asp Ser Val 

165 170 175 

He Asp Leu Tyr Leu Asn Arg Asp Leu Thr Ala Leu Ala Asn Glu Pro 

180 185 190 

Asp Val Val He Lys Gly Ala Phe Asn Gly Trp Lys Trp Arg Leu Phe 
195 200 205 

Thr Glu Arg Leu His Lys Ser Asp Leu Gly Gly Val Trp Trp Ser Cvs ^ 
210 215 220 

Lys Leu Tyr He Pro Lys Glu Ala Tyr Arg Leu Asp Phe Val Phe Phe 
225 230 2.35 240 

Asn Gly Arg Thr Val Tyr Glu Asn Asn Gly Asn Asn Asp Phe Cys He 

245 250 255 

Gly He Glu Gly Thr Met Asn Glu Asp Leu Phe Glu Asp Phe Leu Val 

260 265 270 

Lys Glu Lys Gin Arg Glu Leu Glu Lys Leu Ala Met Glu Glu Ala Glu 
275 280 285 

Arg Arg Thr Gin Thr Glu Glu Gin Arg Arg Arg Lys Glu Ala Arg Ala 
290 295 300 

Ala Asp Glu Ala Val Arg Ala Gin Ala Lys Ala Glu He Glu He Lys 
^05 310 315 320 

Lys Lys Lys Leu Gin Ser Met Leu Ser Leu Ala Arg Thr Cys Val Asp 

325 330 335 

Asn Leu Trp Tyr He Glu Ala Ser Thr Asp Thr Arg Gly Asp Thr He 

340 345 350 

Arg Leu Tyr Tyr Asn Arg Asn Ser Arg Pro Leu Ala His Ser Thr Glu 
355 360 365 

He Trp Met His Gly Gly Tyr Asn Asn Trp Ser Asp Gly Leu Ser He 
370 375 380 

Val Glu Ser Phe Val Lys Cys Asn Asp Lys Asp Gly Asp Trp Trp Tyr 
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acc tac gat gag cct ctt tct cac ctg ata tac get ggc tec gac ttc 2832 
Thr Tyr Asp Glu Pro Leu Ser His Leu He Tyr Ala Gly Ser Asp Phe 
930 935 ■ 940 

att att gtc cct tea ate ttt gaa cec tgt ggc tta aca caa ctt gtt 2880 
He He Val Pro Ser He Phe Glu Pro Cys Gly Leu Thr Gin Leu Val 
945 950 955 960 

gcc atg cgt tat gga teg ate cet ata gtt egg aaa ace gga gga ett 2928 
Ala Met Arg Tyr Gly Ser He Pro He Val Arg Lys Thr Gly Gly Leu 

965 970 975 

tac gac act gtc ttc gac gta gac aat gat aag gac egg get egg tet 2976 
Tyr Asp Thr Val Phe Asp Val Asp Asn Asp Lys Asp Arg Ala Arg Ser 

980 985 990 

ett ggt ctt gaa cca aat ggg ttc agt ttc gac gga gee gac age aat 3024 
Leu Gly Leu Glu Pro Asn Gly Phe Ser Phe Asp Gly Ala Asp Ser Asn 
995 1000 1005 

ggc gtg gat tat gcc etc aac aga gca ate ggc get tgg tte gat gee 3072 
Gly Val Asp Tyr Ala Leu Asn Arg Ala He Gly Ala Trp Phe Asp Ala 
1010 1015 1020 

egt gat tgg ttc cae tec ctg tgt aag agg gtc atg gag caa gac tgg 3120 
Arg Asp Trp Phe His Ser Leu Cys Lys Arg Val Met Glu Gin Asp Trp 
1025 1030 1035 1040 

teg tgg aac egg cct gea ctg gac tac att gaa ttg tac eat gee get 3168 
Ser Trp Asn Arg Pro Ala Leu Asp Tyr He Glu Leu Tyr His Ala Ala 

1045 1050 1055 



cga aaa ttc tgacacecaa ctgaaccaat ggeaagaaca agcgeattgt 3217 
Arg Lys Phe 



gggategaet 


acagtcatae 


agggetgtgc 


agatcgtctt 


gettcagtta 


gtgecctett 


3277 


cagttagttc 


eaagegeact 


acagtcgtae 


atagetgagg 


atectettgc 


etceteeace 


3337 


ag^^gcLaaea 


aagcagaaat 


gcataagtge 


attgggaaga 


cttttatgta 


tattgttaaa 


3397 


tttttccttt 


tetttteett 


ccetgeaect 


ggaaatggtt 


aagcgcateg 


cegagataag 


3457 


aaccaeagta 


aeattetgtg 


agtagetttg 


tatattctct 


catcttgtga 


aaaetaatgt 


3517 


gcatgttagg 


ctetetgatc 


atgtggaagc 


tttgttatat 


gttacttatg 


gttatatggt 


3577 


atacateaat 


gatat ttaca 


tttgtggaaa 


aaaaaaar:aa 


aaaa 




3621 



<210> 10 
<211> 1059 
<212> PRT 

<213> Tritieum aestivum 
<400> 10 

Asp Ala Leu Tyr Val Asn Gly Leu Glu Ala Lys Glu Gly Asp His Thr 
15 10 15 

Ser Glu Lys Thr Asp Glu Asp Ala Leu His Val Lys Phe Asn Val Asp 

20 25 30 



Asn Val Leu Arg Lys His Gin Ala Asp Arg Thr Gin Ala Val Glu Lys 
35 40 45 
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Cys His Asp Trp Ser Ser Ala Pro Val Ala Trp Leu Tyr Lys Glu His 

740 745 750 

Tyr Ser Gin Ser Arg Met Ala Ser Thr Arg Val Val Phe Thr lie His 
755 760 765 

Asn Leu Glu Phe Gly Ala His Tyr lie Gly Lys Ala Met Thr Tyr Cvs 
770 775 780 

Asp Lys Ala Thr Thr Val Ser Pro Thr Tyr Ser Arg Asp Val Ala Gly 

790 795 800 

His Gly Ala lie Ala Pro His Arg Glu Lys Phe Tyr Gly He Leu Asn 

805 810 815 

Gly He Asp Pro Asp He Trp Asp Pro Tyr Thr Asp Asn Phe He Pro 

820 825 830 

Val Pro Tyr Thr Cys Glu Asn Val Val Glu Gly Lys Arg Ala Ala Lys 
835 840 845 

Arg Ala Leu Gin Gin Lys Phe Gly Leu Gin Gin Thr Asp Val Pro He 
850 855 860 

Val Gly He He Thr Arg Leu Thr Ala Gin Lys Gly He His Leu He 

870 875 880 

Lys His Ala He His Arg Thr Leu Glu" Ser Asn Gly Gin Val Val Leu 

885 890 895 

Leu Gly Ser Ala Pro Asp His Arg He Gin Gly Asp Phe Cys Arg Leu 

900 905 910 

Ala Asp Ala Leu His Gly Val Tyr His Gly Arg Val Lys Leu Val Leu 
915 920 925 

Thr Tyr Asp Glu Pro Leu Ser His Leu He Tyr Ala Gly Ser Asp Phe 
930 935 940 

He He Val Pro Ser He Phe Glu Pro Cys Gly Leu Thr Gin Leu Val 
945 950 955 960 

Ala Met Arg Tyr Gly Ser He Pro He Val Arg Lys Thr Gly Gly Leu 

965 970 975 

Tyr Asp Thr Val Phe Asp Val Asp Asn Asp Lys Asp Arg Ala Arg Ser 

980 985 990 

Leu Gly Leu Glu Pro Asn Gly Phe Ser Phe Asp Gly Ala Asp Ser Asn 
995 1000 1005 

Gly Val Asp Tyr Ala Leu Asn Arg Ala He Gly Ala Trp Phe Asp Ala 
■ 1010 1015 1020 

Arg Asp Trp Phe His Ser Leu Cys Lys Arg Val Met Glu Gin Asp Trp 
1025 1030 1035 1040 

Ser Trp Asn Arg Pro Ala Leu Asp Tyr He Glu Leu Tyr His Ala Ala 

1045 1050 1055 

Arg Lys Phe 



<210> 11 
<211> 728 
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385 390 395 400 

Ala Asp Val lie Pro Pro Glu Lys Ala Leu Val Leu Asp Trp Val Phe 

405 410 415 

Ala Asp Gly Pro Ala Gly Asn Ala Arg Asn Tyr Asp Asn Asn Ala Arg 

420 425 430 

Gin Asp Phe His Ala lie Leu Pro Asn Asn Asn Val Thr Glu Glu Gly 
435 440 445 

Phe Trp Ala Gin Glu Glu Gin Asn lie Tyr Thr Arg Leu Leu Gin Glu 
450 455 460 

Arg Arg Glu Lys Glu Glu Thr Met Lys Arg Lys Ala Glu Arg Ser Ala 
465 470 475 480 

Asn lie Lys Ala Glu Met Lys Ala Lys Thr Met Arg Arg Phe Leu Leu 

485 490 495 

Ser Gin Lys His lie Val Tyr Thr Arg Thr Xaa Leu Lys Tyr Val Pro 

500 505 510 

Gly Thr Thr Val Asp Val Leu Tyr Asn Pro Ser Asn Thr Val Leu Asn 
515 520 525 

Gly Lys Ser Glu Gly Trp Phe Arg Cys Ser Phe Asn Leu Trp Met His 
530 535 540 

Ser Ser Gly Ala Leu Pro Pro Gin Lys Met Val Lys Ser Gly Asp Gly 
545 550 555 560 

Pro -Leu Leu Lys Ala Thr Val Asp Val Pro Pro Asp Ala Tyr Met Met 

565 570 575 

Asp Phe Val Phe Ser Glu Trp Glu Glu Asp Gly lie Tyr Asp Asn Arg 

580 585 590 

Asn Gly Met Asp Tyr His lie Pro Val Ser Asp Ser lie Glu Thr Glu 
595 600 605 

Asn Tyr Met Arg lie lie His lie Ala Val Glu Met Ala Pro Val Ala 
610 615 620 

Lys Val Gly Gly Leu Gly Asp Val Val Thr Ser Leu Ser Arg Ala lie 
625 630 635 640 

Gin Asp Leu Gly His Thr Val Glu Val lie Leu Pro Lys Tyr Asp Cys 

645 650 655 

Leu Asn Gin Ser Ser Val Lys Asp Leu His Leu Tyr Gin Ser Phe Ser 

660 665 670 

Trp Gly Gly Thr Glu lie Lys Val Trp Val Gly Arg Val Glu Asp Leu 
675 680 685 

Thr Val Tyr Phe Leu Glu Pro Gin Asn Gly Met Phe Gly Val Gly Cys 
690 695 700 

Val Tyr Gly Arg Asn Asp Asp Arg Arg Phe Gly Phe Phe Cys His Ser 
705 710 715 720 

Ala Leu Glu Phe lie Leu Gin Asn Glu Phe Ser Pro His He He His 

725 730 735 
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agcatcaggc agatagaacc caagcagtgg aaacgataac ttggaagaaa gttgatgagg 900 
aacatcttta catgactgaa catcagatag gtgctgccga aggacagatg gtagttaacg 960 
aggatgagct ttctataact gaaattggaa tggggagagg tgataaaatt cagcatgtgc 1020 
tttctgagga agagctttca tggtctgaag atgaagtgca gttaattgag gatgatggac 1080 
aatatgaagt tgatgagacc tctgtgtccg ttaacgttga acaagatatc caggggtcac 1140 
cacaggatgt tgtggatccg caagcactaa aggtgatgct gcaagaactc gctgagaaaa 1200 
attattcgat gaggaacaag ctgtttgttt ttccagaggt agtgaaagct gattcagtta 1260 
ttgatcttta tttcaatcgt gacctaacag ctttggcgaa tgaacccgat gttgtcatca 1320 
aaggagcatt caatggttgg aaatggaggc ttttcactga aagattgcat aagagtgacc 1380 
ttggaggggt ttggtggtct tgcaaactgt acatacccaa ggaggcctac agattagact 14 40 
ttgtgttctt caacggtcgc acggtctatg agaacaatgg caacaatgat ttctgtatag 1500 
gaatagaagg cactatgaat gaagatctgt ttgaggattt cttggttaaa gaaaagcaaa 1560 
gggagcttga gaaacttgcc atggaagaag ctgaaaggag gacacagact gaagaacagc 1620 
ggcgaagtaa ggaagcaagg gctgcagatg aagctgtcag ggcacaagcg aaggccgaga 1680 
tagagatcaa gaacaaaaaa ttgcagagta tgttgagttt ggccagaaca tgtgttgata 1740 
atttgtggta catagaggct agcacagata caagcggaga tactatcagg ttatactata 1800 
acagaaactc gaggccactt gcgcatagta ctgagatttg gatgcatggt ggttacaaca 1860 
attggtcaga tggactctct attgttgaaa gctttgtcaa gtgcaatgac agagacggcg 1920 
attggtggta tgcagatggt acgacacctc aacctttgta cataaggcaa cattgttttg 1980 
attttttttg ttgaggaaac atttgttttg attctagcat aatgctccta caaatatggc 2040 
atgaatttcc ttgttttatt gatgtcatga gaaagtattt tattaactcg aaggccatgg 2100 
aagctcaaca tttaccatag acagacgctt aaagatcatt tgtattccgt ggatcatata 2160 
tgtaatgtaa tacctgtctt ttctctatat gtacagttat tccacctgaa aaagcacttg 2220 
tgttggactg ggtttttgct gatgggccag ctgggaatgc aaggaactat gacaacaatg 2280 
ctcgacaaga tttccatgct attcttccaa acaacaatgt aaccgaggaa ggcttctggg 2340 
tgcaagagga gcaaaacatc tatacaaggc ttctgcaaga aaggagagaa aaggaagaaa 2400 
ccatgaaaag aaaggtgagt tgcaacaaaa tctttgcata tagatc 24 4 6 

<2i0> 13 

<211> 1032 

<212> DNA 

<213> Triticum sp. 

<400> 13 

gatctctata attttggcag ttaacccctg agtgatggca aatatattcc ctttcgtcta 60 
ttttccaaat tcaaaatgca tggttccatg caagcttatc caaaatcact tgataatata 120 
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<212> DNA 

<213> Triticum sp. 

<400> 11 

gatcttgaac ggcacgtgaa agacttgtaa caacatcccc gagacctcca acctatgaga 60 
tcatcgatca tgacagagca tagtattatg gcatagaatg aaaaaaaggc ataaggtgat 120 
gagatctcca cgccagagcg ttgtattcca attttagttc tttccccgtg aggaggggag 180 
gctaggcggg cgaggcagag gggatagggc agtcgccgct gcgtggtgga ctgactggtg 240 
tggtgggtgg tgggttttgc gggcggggtt tagtaggttc ccggaaatgg agatggctct 300 
ccggccacgg agccctctgt gccctcggag cagtcagccg ctcgtcgtcg tccggccggc 360 
cggccgcggc ggcggcctcg cgcaggtacg ggtgattatg gttcttgatt cggtcggttc 420 
acggaatgtt gtttgatttg gttctgtccc gggtcaggtt catagtgatt ttattccgca 480 
aaaaaaaaag gtttatagtg attttgattt ctttcatctc gggaacattt ttatatctgg 540 
gagtcaaagg gcattggttt tgatttgcat gcggaacata ttggttattt attaatgtgg 600 
tgagctggaa ttcatactgc ttaaaacgac gtgattttaa ttgctggaag aggtaaagaa 660 
catgaattct tgttatattt gttaaaaaaa atcccctgtt ctagcgtttc aatctgcatg 720 
atcatgga 728 

<210> 12 

<211> 2446 

<212> DNA 

<213> Triticum sp . 

<400> 12 

gtgggtctat aaaagacagg tttgagcgga ttcgtcagga aatgtttcaa caagtgcgac 60 
gatgtgggat gcaattgatg aaaccgtggc ttgatcaaga cgcagttgag gcggatttgt 120 
cgggaaatgc ttcaagctgc gcgacataca gagaagtgga tgatgtggtg gatgaaacta 180 
gatcagaaga ggaaacattt gcgatggatt tgtttgcaag tgaatcaggc catgagaaac 240 
atatggcagt ggatcatgtg ggtgaagcta ccgatgaaga agagacttac caacagcaat 300 
atccagtacc gtcttcattc tctatgtggg acaaggctat tgctaaaaca ggtgtaagtt 36*^ 
tgaatcctga gctgcgactt gtcagggttg aagaacaagg caaagtaaat tttagtgata 420 
aaaaagacct gtcaattgat gatttaccag gacaaaacca atcgatcatt ggttcctata 4 80 
aacaagataa atcaattgct gatgttgcgg gaccgaccca atcaattttt ggttctagta 540 
aacaacaccg gtcaattgtt gctttcccca aacaaaacca gtcaattgtt agtgtcactg 600 
agcaaaagca gtccatagtt ggattccgta gtcaagatct ttcggctgtt agtctcccta 660 
aacaaaacgt accaattgtt ggtacgtcga gagagggtca aacaaagcaa gttcctgttg 720 
ttgatagaca ggatgcgttg tatgtgaatg gactggaagc taaggaggga gatcacacat 780 
ccgagaaaac cgatgaggat gtgcttcatg taaaatttaa tgttgacaat gtgttgcgga 840 
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•ttctgattca attgaaacag agaattacat gcgtattatc cacattgccg ttgagatggc 840 
ccccgttgca aaggtaatat aattctaagg ctagtttctt tgatgcgagg eg 892 

<210> 15 
<211> 871 
<212> DNA 

<213> Triticum sp. 
<400> 15 

aggttatcct ccagaatgaa ttttttccag tacgtattat ttagaatact agcggtatat 60 
tgactttttc tttgtgagac tacactttct tgtttaccat tccagtgcac catgttcaaa 120 
atcttgtatt cagcgcgtta ctttcagttt ctttactact agcttatttg gtgcattggt 180 
gtttcctttc ctactctact atctgaatgc tacttgtgtt ttcgcaacag ttgcttcttt 240 
atccccttcc atttctcagt taaaaaaact tgcatctgta ttcacgtgac agcatataat 300 
acattgccat gattggtcaa gtgctccggt cgcctggcta tataaggaac actattccca 360 
atccagaatg gcaagcactc gggttgtatt taccatccac aatcttgaat ttggagcaca 420 
ttatattggt aaagcaatga catactgtga taaagccaca actgtgagtg ccttactgtc 480 
ttgtaatttt taatctttct gtttggcgca cagaaaatct tccacatttt acagaatcat 540 
gttcttgtgt tttgtacgta ttcaactatt tccacccaaa cttttcaggt ttctcctaca 600 
tattcaaggg acgtggcagg ccatggtgcc attgctcctc atcgtgagaa attctacggc 660 
attctcaatg gaattgatcc agatatctgg gatcctgatt gccaacatgc tgtttggtcg 720 
tctcgaggtc tttacattgc tggtgctctt taccccgact ttctggcgtg aatgatggag 780 
taatacgtga aaacattaat tcttttctca acaagggacg gacaaacgcg cgagattgcc 840 
tcctacctgg cttcggaact gaaagaactg g 871 

<210> 16 

<211> 1592 

<212> DNA 

<213> Triticum sp, 

<400> 16 

cgggaattct cgatcccgtg gctaactcct tagtcacatt gagctcatta tgatgatgca 60 
ttaccgagtg ggcccaaaga tacctctccg tcatacggag tgacaaatcc cagtctcgat 120 
tcgtgccaac ccaacagaaa ctttcggaga tacctgtagt gcacctttat agtcacccag 180 
ttacgttgtg acatttgata cacccaaagc actcctacga tatccgggag ttgcacaatc 240 
tcatggtcta aggaaatgat acttgacatt agaaaagctc tagcgaacga actacacgat 300 
cttgtgctat gcttaggatt gggtcttgtc catcacatca ttctcctaat gatgtgatcc 360 
atacactgac aattttatcc cggtaccaga ttttttccca gagtgcaagt agatatatac 420 
caaggccaca gatagtttta tgcttaacta tgtgtttcat actacttcag gtcccttata 480 
cttgtgagaa tgttgtcgaa ggcaagagag ctgcaaaaag ggccttgcag cagaagtttg 540 
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ccaatcacaa cataactttg tttaccataa gaacattcct acttaaaatt tgcaaggtaa 180 
ctccctttcg aggctggttg gcttgatgag taactggcaa ttaacaaaga aaagatatat 240 
ctgatgtttg gaacaaaaca tatgatcagg gttgtttggg ttgactcatg ttccttttta 300 
cctacacagg ctgagagaag tgcaaatatc aaagctgaga tgaaggcaaa aactatgcga 360 
aggtttctgc tttcccagaa acacattgtt tataccgaac cgcttgaaat acgtgccgga 420 
accacagtgg atgtgctata caatccctct aacacagtgc taaatggaaa gccggaggtt 480 
tggtttagat gctcttttaa cctttggatg catccaagtg gagcattgcc accccagaag 540 
atggtgaaat caggggatgg gccgctctta aaagccacag gtttattgcg ttattacatc 600 
actgttatta gtatatatat aaccattttt atgcaatcaa tagagtcaag tgcaactaat 660 
gatgcacaga taggatcaca tcattaggag aatgatgtga tggacaagac ccaatcctaa 720 
gcatagcaca agatcgtgta gttcgttcgc tagagctttt ctaatgtcaa gtatcatttc 780 
cttagaccat gagattgtgc aactcccgga tatcgtagga gtgctttggg tgtatcaaat 840 
gtcacaacgt aactgggtga ctataaaggt gcactacagg tatctccgaa agtttctgtt 900 
gggttggcac gaatcgagac tgggatttgt cactccgtat gacggagagg tatctttggg 960 
cccactcggt aatgcatcat cataatgagc tcaatgtgac taaggagtta gccacgggat 1020 
cgagaattcc eg 1032 

<210> 14 

<211> 892 

<212> DNA 

<213> Triticuzn sp. 

<400> 14 

aatatttctt gttctattat tggtaataat tagctagttt aatgccataa gcccataaca 60 
gatatgcaac tactccctcc aatccatatt acttgtcgca actttggtac aactttagta 120 
caaagttata ctaaagctgt gacaagtaat atggaccgga gggagtacta tataagcttg 180 
tagctgtttt gagaccgagt gtctgctcgg gtggctagct ggagcgggct gaagtgcttg 240 
caggcacctc ttctcta ^aa aaaagtgctt gcagcccccc cgccccctcc atagggtgag 300 
tggtcacctt tcttcttaaa aattatggca ccaagggaaa ttctcggctg gtcgagcttg 360 
tagctatttt ttcggagcgt gaatgggagc gtctttctgt ataaggccta taggcttact 420 
ttgatatata ttgtgaagtc acttaagcct tgttaaaacg tagaaactta gttccgcaac 480 
ttggccaaat ccctgttaaa ttggtttact gtgtactaga tgcatcgatg gcgcagagtc 540 
ccggggggta ataaagcttc cattttctac aatgaagtta attatcctac ttgccttgta 600 
attactgagt acaatacaga gcaccgaaaa gctgtatcct tcctacttcc ttatgtttat 660 
ctgtgttcct tgtctagtta atgttccacc ggatgcctat atgatggact ttgttttctc 720 
cgagtgggaa gaagatggga tctatgacaa caggaatggg atggactatc atattcctgt 780 
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<211> 10 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PEPTIDE MOTIF 
<400> 19 

Ala Thr Gly Lys Lys Asp Ala Gly lie Asp 
15 10 



<210> 20 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PEPTIDE MOTIF 
<400> 20 

Ala Leu Gly Lys Lys Asp Ala Gly lie Asp 
15 10 



<210> 21 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : PEPTIDE MOTIF 
<400> 21 

Ala Thr Gly Lys Lys Asp Ala Leu 
1 5 



<210> 22 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : PEPTIDE MOTIF 
<400> 22 

Ala Leu Gly Lys Lys Asp Ala Leu 
1 5 



<210> 23 
<2ll> 14 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PEPTIDE MOTIF 
<400> 23 

Ala Ala Gly Lys Lys Asp Ala Arg Val Asp Asp Asp Ala Ala 
1-5 10 
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gattacagca aactgatgtc cctattgtcg gaatcatcac ccgtctgaca gcccagaagg 600 
gaatccacct catcaagcac gcaattcacc gaaccctcga aagcaacgga caggttcatc 660 
atcccttgtg aacgaataaa catcaaacgt tttgtttata aaaagttgct tactatttgt 720 
ttttgtttac ttcaaaacaa aagtctgaaa atgaagtgtt tggttcctag gtggttttgc 780 
ttggttcagc tccagatcat cgaatacaag gcgatttttg cagattggcc gatgctcttc 840 
acggtgttta ccacggtagg gtgaagcttg ttctaaccta cgatgagcct ctttctcacc 900 
tggtgagctc caatatccta cacaccatct agccagccct tcattatggg agctggagac 960 
tactttataa tttaggttga tgatcgatca tgctgcagat atacgctggc tccgacttca 1020 
ttattgtccc ttcaatcttc gaaccctgtg gcttaacaca acttgttgcc atgcgttatg 1080 
gatcgatccc tatagttcgg aaaaccggag gtgtgtgact atttctctcc attatgctgc 1140 
actgatttgc atatgtcgag ctgttggaca tgaaatggaa actatccttt ggtatcgcag 1200 
gactttacga cactgtcttc gacgtagaca atgataagga ccgggctcgg tctcttggtc 1260 
ttgaaccaaa tgggttcagt ttcgacggag ccgacagcaa cggcgtggat tatgccctca 1320 
acaggcaagt atcgttcctc aattagccct gaattcagca gtagtgctag gttatttacc 1330 
ttgcatgttc catacctcat ttcagagcaa tcggcgcttg gttcgatgcc cgtgattggt 14 40 
tccactccct gtgtaagagg gtcatggaac aagactggtc atggaaccgg cccgcactgg 1500 
actacattga attgtaccat gccgctcgaa aattctgaca cccaactgaa ccaatggcaa 1560 
gaacaagcgc attgtgggat cgagaattcc eg 1592 

<210> 17 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : PEPTIDE MOTIF 
<400> 17 

Asp Val Gin Leu Val Met Leu Gly Thr Gly 
15 10 



<210> 18 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : PEPTIDE MOTIF 
<400> 18 

Ala Ala Gly Lys Lys Asp Ala Gly lie Asp 
15 10 



<210> 19 
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<220> 

<223> Description of Artificial Sequence : PRIMER 
<400> 29 

ggaggtcttg gtgatgttgt 

<210> 30 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : PRIMER 
<400> 30 

cttgaccaat catggcaatg 



<210> 31 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : PRIMER 
<400> 31 

cattgccatg attggtcaag 



<210> 32 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : PRIMER 

<400> 32 

accacctgtc cgttccgttg c 



<210> 33 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : PRIMER 
<400> 33 

gcacggtcta tgagaacaat ggc 

<210> 34 
<211> 21 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: PRIMER 



<400> 34 
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<210> 24 
<211> 13 
<212> PRT 

<213> Artificial Sequence 

<223> Description of Artificial Sequence : PEPTIDE MOTIF 

lll^L^Gly Lys Lys Asp Ala Gly He Val Asp Gly Ala 
1 5 10 



<210> 25 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : PRIMER 
<400> 25 

tgttgaggtt ccatggcacg ttc 



<210> 26 
<211> 23 
<212> DNA 

<213> Artificial Sequence 

<223> Description of Artificial Sequence : PRIMER 
<400> 26 

agtcgttctg ccgtatgatg teg 



<210> 27 
<211> 21 

<212> DNA ^ 
<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: PRIMER 
<400> 27 

ccaagtacca gtggtgaacg c 



<210> 28 
<211> 19 
<212> DNA 

<213> Artificial Sequence 

^ • oi^ Artificial Sequence: PRIMER 

<223> Description or Art iri^ -'^^^ 

<400> 28 

cggtgggatc caacggccc 



<210> 29 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
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caatgcttct cgttcggttt atcccctagc acgcacgaac gcatgtgccg caccgcgtca 780 
cgcaacgcat gcatgcacaa accaacaaac gaaacagtgc agttgcagtg ctctatctac 840 
atatacgcaa tcaacgcggg cctcctcctt cgccgcgagc cccgttccgt cctcggtctt 900 
cacgtggatt ttgcaacttc cttccagcag cttgtcacca cggacgcttc ctctctgaca 960 
actggccccg tgggcggaac ggggcctccg ctcgcccctt gcgaaaccca cggctcgtcc 1020 
gttcgcttct ctagcgggca ccgacagaag gggccggcgc agggtaggac caggctgtca 1080 

4 

gctggtgagg agcctgccgc tcgttgtgcc gcagctggag accgagcggg gcaacggaac 1140 
ggctgccgcc ctcgtgtgct gctcgcgtgg cacgccgcaa cggcaccggg cccgctttcc 1200 
agcgtgctcg cccgcaaacc gcagacccaa cacgccagcc gccagggggc cgttcgtacg 1260 
tacccgcccc tcgtgtaaag ccgccgccgt cgtcgccgtc ccccgctcgc ggccatttct 1320 
tcggcctgac cccgttcgtt tacccccaca cagagcacac tccagtccag tccagcccac 1380 
tgccaccgcg ctactctcca ctcccactgc caccacctcc gcctgcgccg cgctctgggc 1440 
ggaccaaccc gcgaaccgta ccatctcccg ccccgatcca tgtcgtcggc ggtcgcgtcc 1500 
gccgcatcct tcctcgcgct cgcgtcagcc tcccccggga gatcacgcag gcgggcgagg 1560 
gtgagcgcgc agccacccca cgccggggcc ggcaggttgc actggccgcc gtggccgccg 1620 
cagcgcacgg ctcgcgacgg agctgtggcg gcgctcgccg ccgggaagaa ggacgcgggg 1680 
atcgacgacg ccgccgcgtc cgtgaggcag ccccgcgcac tccgcggtgg cgccgccacc 1740 
aaggtagtta gttatgacca agttatgacg cgtgcgcgcg cctcgagatc atcgtcgtct 1800 
cgctcacgaa ttgtttattt atacaaaacg cacgcccgcg tgtgcaggtc gcggagcgaa 18 60 
gggatcccgt caagacgctc gaccgcgacg ccgcggaagg cggcgggccg tccccgccgg 1920 
cagcgaggca ggacgccgcc cgtccgccga gtatgaacgg catgccggtg aacggcgaga 1980 
acaaatctac cggcggcggc ggcgcgacta aagacagcgg gctgcccacg cccgcacgcg 2040 
cgccccatcc gtcgacccag aacagagcac cggtgaacgg tgaaaacaaa gctaacgtcg 2100 
cctcgccgcc gacaagcata gccgaggccg ^ggcttcgga ttccgcagct accatttcca 2160 
tcagcgacaa ggcgccggag tccgttgtcc cagctgagaa gacgccgccg tcgtccggct 2220 
caaatttcga gtcctcggcc tctgctcccg ggtctgacac tgtcagcgac gtggaacaag 2280 
aactgaagaa gggtgcggtc gttgtcgaag aagctccaaa gccaaaggct ctttcgccgc 2340 
• ctgcagcccc cgctgtacaa gaagaccttt gggatttcaa gaaatacatt ggtttcgagg 2400 
agcccgtgga ggccaaggat gatggccggg ctgtcgcaga tgatgcgggc tcctttgaac 24 60 
accaccagaa tcacgactcc ggacctttgg caggggagaa tgtcatgaac gtggtcgtcg 2520 
tggctgctga gtgttctccc tggtgcaaaa caggcatgga cattacctct tcagtctctc 2580 
ttcctgttgt tcataaaact ttgctcgaat tactcataag aacaaacatt gtgttgcata 2640 
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tctgcatacc accaatcqcc q 

21 

<210> 35 
<211> 25 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : PEPTIDE MOTIF 
<400> 35 

Lys Val GXy Gly Leu Gly Asp Val Val Thr Ser Leu Ser Arg Ala Val 



10 15 



Gin Asp Leu Gly His Asn Val Glu Val 

20 25 



<210> 36 
<2ll> 25 
<212> PRT 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: PEPTIDE MOTIF 
<400> 36 ' 
Lys val Gly Gly Leu Gly Asp Val Val Thr Ser Leu Ser Arg Ala He 
^ 5 10 

Gin Asp Leu Gly His Thr Val Glu Val 

20 25 



15 



<210> 37 

<211> 9024 

<212> DNA 

<213> Triticum sp. 

<400> 37 

aaatatgaaa ccaaaaaaaa aatagaaaaa ggaaaggtaa aatagaaagt taaataggaa 60 
taatggataa aaaataaaac atcaaagaaa aacgaaatgc agaagaaaaa aacgtcactt 120 
gttcccttat tatctcccgt gcaccccggt agcgtaggac aaaaagaaaa aatagaacgg 180 
acccaacgtc acaagctcac acatgcccag cgagagaaaa gaaaaatggt gcgacaaaaa 240 
aaaggaaacg ggctgagagc cgaaacacat gggctgcgct ttgttcgcta cgaagctctc 300 
ccctcgacaa aatatgaatc gcgacgtgat tggatcctat ggtggaaaaa gtgaatgaga 360 
ccaaaagaat tctcagctga atgagtttta gcaagactga tcattatatc caacataaat 420 
agattttttt tttgcaaaaa taatccaaat ctattagcaa agttcagtag aagtacaaag 480 
catctcgaac attataaaca ttacactgag attccaggac caccaaacaa cccactactg 540 
ccgcgaaaag aaaaggattc ggaagacaga aattatccaa accacgttcg tccttggttg 600 
ttggtctcat tgcgcgctaa acaacctgga cagcagaaga agcaaagcag tgtgcttccg 660 
ctccgcagca agaagacaag tcgtcacatg tcagacgccg tcactcaagc aagcaaactg 720 
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tatttactcc acattgtttg aggaaactaa 
agttatagtc ttatagaggt aaatgcacca 
tttggtgctt acagttgtag actatgaaaa 
tacggtgcat tttccgtatg taggagtcaa 
ctatagctgt tagaccgtgc ctacgtcgcc 
gggccccact tgtcaaccta tgacataaat 
tggggtcttg aaaatgggac ctcgcaggta 
tcccctatgc acttcatgtc ttgtgtatgt 
atgctgtttt tctttggttc aaggctacca 
ggccagcgcc ttcatgatgg cccaagtgct 
tgtcatgacc atccaccaac ccaacacaca 
gcccctcttg tcctttcccc tcgtacccaa 
agttgtgacc atcgcctgcg tcgcctcata 
cctacttggg agcccatacc tccctgcaca 
atcgcaaacc aacttctcct ctccttctcc 
gcaatacatg ccgagttggc catggcccta 
ctctaagcct agcacctttt cccctcacca 
cctacgtcgg ctgcagttgc ctgccgcctc 
tcggcgacat ctcctcgacc tcccattcca 
catccatgtg aaccgaatca tcatagaact 
cactgttcct ctattccccc caagccgtgt 
atcccttggg tcatcggttc aatggctatt 
cacacgccac actaagccct ttctttctcc 
tacttagcca gagagagaac atgagcttgt 
tcttaacggc tacaaacaac ggatatggtg 
tcatactgca tgcgagagcc agagccaggt 
gcgagcatca aagtgtacat atgccgaacc 
ggtgcggtgg gtggctcaaa gacaccccaa 
cggtgccgaa ccatattgaa gtggtgaggt 
ggatgaggga cataaaggat ctcataaata 
gcgaagcgct tcatgatttc catctcccct 
atttctcagg tcgcttctcg tctaaatccg 



-48 - 



acgggttgca 


aaattatgat 


ggcttatgaa 


4 680 


gtggtgcttg 


aacttgtcac 


gcgtgttcac 


4740 


acgggtgcaa 


aaacttgctg 


t tgtgtgcca 


4800 


acgttgccta 


tgtgggcatt 


gtattcccgt 


4860 


attgggccca 


cacactctct 


atttacatgt 


4920 


aaatggaaat 


ttataataaa 


aatgatggcc 


4980 


tgctggtagc 


cagcacgccc 


taaacattaa 


5040 


gtgtgtctgt 


gtggggaggg 


gggggtatgc 


5100 


tgctcaacaa 


gcccacctcc 


gcttcaacac 


5160 


ccgcaccatc 


gctcaaagcg 


gcaacgtcgt 


5220 


aaatcctcaa 


catccgcaaa 


tagtgagcat 


5280 


acatgtcttg 


ataacccttg 


gagctgcaca 


5340 


gagcccgacc 


tagccggacc 


gttatagaag 


5400 


tcctcctctt 


tccccataga 


tcgtgccgcc 


5460 


cactctggcc 


gtttcccccg 


ccgcgaagct 


5520 


ttccccaatt 


gctcgcacta 


ggaggtcctc 


5580 


attgcaagtt 


ggggagcccc 


tcgcgagctc 


5640 


aactctgatc 


cagacctcgt 


tcccgtggcc 


5700 


cacgtggcct 


ggcgaggatc 


accgcatgtt 


5760 


aacaccggag 


aggtcatccc 


gacggcgtcg 


5820 


cgcgtcataa 


tataagacgg 


acttatttgt 


5880 


tctttctcct 


gtctactgat 


aagtgggacc 


5940 


tacccgttga 


taagtgggac 


ccacacacag 


6000 


tggtgccacg 


tcggcaagcc 


atgtcagcag 


6060 


tcacgtgagc 


gtttacgaat 


ggaaagtgca 


6120 


ttttgcacca 


gttttctgta 


ttttacaact 


6180 


aaagtgaaca 


tggtgagtcc 


attcttttct 


6240 


tagaagctat 


tgcctccgac 


attgccaatt 


6300 


cagttgcttg 


tgctatgact 


actaggtatt 


6360 


ttgcaatgtt 


cattcaaatt 


cttaacattt 


6420 


agatcagaga 


cacttggtcg 


tgtacactga 


6480 


catatgtagc 


tcacttcaat 


gacttgcctt 


6540 
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ggtggtctgg gagatgttgc gggtgctctg 
gttatggtac tacaagcttt catttaactc 
tgagtagtat aatgttatta agtgcaagac 
gtaaattaat ataagagcgt ttagattact 
tttacagacg gagtagagta tttcatagcc 
tgggtggggg agggggtttg aaacaagtgg 
ggctgataac cacaccatca gtgaaggaat 
ccaacgtcgg gtttacccgc cctatagatc 
accaaatatc gccagcgccc gtgtgtgtat 
tttccggtta atggtttcta tcatattcac 
aatgtagatg atggataaat gtatgttgtc 
gagctagttt cgcggttcgg ttagagccat 
gtgagagagg gttttgggga gttaactttc 
ccagtaaaga gtaaactatt ttctgcaggc 
gaaaatagtt atggtatcat ataaaccata 
gctagacttt gataatctga aattttaaat 
tcttaggttg tggtaccaag gtatggggac 
aaatactaca aggctgctgg acaggtaagc 
tattgcttat tgtcataata aatcaatttt 
aagtgaatta tttccatgct tatatcgatg 
tcttccgaca ccgtcaggaa gacatttatg 
ttggtgtttg attgcactga taaactgaga 
ttacacattt tattttttca ggaaattatg 
gttgaggtat ctctccaact caattgacaa 
atgtatttca acagatacat aatctcttgt 
accttacatg cacatttggt caagcgttat 
tgaacaatta tcttgatgat ccttgttact 
gcgaattgat ttggaaatag catttccacc 
tcatccaatt tagatatttt cgtacttggc 
catttttatt cctctataat ttgcaggttc 
atggggatgg aaatctggtg tttattgcaa 
atctgaaagc atattacagg gaccatggtt 
tacataacat cgctcaccag gttccttttc 



-47- 

cccaaggctt tggcaaagag aggacatcgt 2700 
tgttgggtcc atatgttcga ataatatcag 2760 
atgaaagtgt tcttctgtca tactccctcc 2820 
actttagtga tctaaacgct cttatagtag 2880 
aaccctggag gttaggttgc tgaggcctac 2940 
tggttagcag ccagatttca caaagaagga 3000 
gaatgtcggg tacccgatcg accgttttgc 3060 
cgaataagta gttcctatct tcaattaggt 3120 
ttatactact ggatgatcaa tttatcaaca 3180 
tgtaattgtt agtaaacagt agatgtttgt 3240 
gagctttcat ttcaatgcaa ttttgattgg 3300 
caaaacccca gaatttttgg gagttggctt 3360 
gggattcagt tagagacgct cttactagtt 3420 
atcccaatta ttctgtagaa attagaagtg 34 80 
tattattcaa aatctagaat catggacttg 3540 
ttgatgataa ttgagaaatg atcctttcta 3600 
tatgaagaag cctacgatgt cggagtccga 3 660 
aaaaatgcaa tcgaagggga gctgaaattt 3720 
taagtgtttt ttttgtcctg caggatatgg 3780 
gagttgattt tgtgttcatt gacgctcctc 3840 
ggggcagcag acaggttaat cttctatatg 3900 
acaagccaag gcctactgac tggcatatga 3960 
aagcgcatga ttttgttctg caaggccgct 4020 
cctattacca ctatacaatt atgtgtatgc 4080 
gaagtgcata tatactaata acatttcaat 4140 
gatttaactt ctgataatct attgcactga 4200 
tcatcgttat gtttccatgt tctcttcacc 4260 
tgccacaaac aataatatac actcctactt 4320 
atatcatccc attaaatatt attggtccat 4380 
catggcacgt tccatgcggc ggtgtccctt 4440 
atgattggca cacggcactc ctgcctgtct 4500 
tgatgcagta cactcggtcc attatggtga 4560 
tcctaatctt gatttttctc tagtctctac 4620 
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agctagctca catgtctgat ggatgcacgt 
cggcaagaat gggaagtgaa ttcctccctg 
cagttaaaac aatagcactt cgagtggaag 
tggactcata gcatgttacc aaaaaatgcc 
ccatcaacat ttgaacctat acaaactaga 
ccagatacac aggtgccaaa gggctacaac 
caagtgaagg caacaagcat cactacggag 
gaagagttga agttgtgatt tgacgaaacc 
acacgtcacc gtccaatcca aaga 

<210> 38 
<211> 11611 
<212> DNA 

<213> Triticum aestivum 
<400> 38 

taatccgttt gtctaatgaa atatatgtga 
cacccccccc ctatatgagt attaaattca 
acaattttgg gatactaaac ctggatgctc 
aaatatcagg aaacgtattc tcagtaaaaa 
ctgtttgggt ggatcatagg tcagactata 
ttttcacaaa acttcacatg agagtagatt 
caagttcttt cgaattttcc tagtattttt 
tccaggagct ctggtgtatt .tttcaatata 
acaatctcag aaaaaactgg acggtttatt 
tcagaagtgg ccctcagccc ctcactcttc 
tcctgcacga acattcgcgt tgaagttttt 
agaaagcaag tacaaaaaac accagccatc 
ttccatgtgt gcgcacacgg agaagcagct 
tcgaagctgc tctcggacaa aatggttgaa 
tccacgccag agcgttgtat tccaatttta 
cgggcgaggc agaggggata gggcagtcgc 
gtggtgggtt ttgcgggcgg ggtttagtag 
acggagccct ctgtgccctc ggagcagtca 
cggcggcggc ctcgcgcagg tacgggtgat 
tgttgtttga tttggttctg tcccgggtca 



-50- 

gccatggttg gtttggtagc gcagtgcaaa 8580 
cttgaattag cactttcagt aataatcagt 8540 
tgaacaagaa aaccaacatc acacccggta 8700 
tttcgccccg ctgtatatat aaagcaacga 8760 
acacaccact caaaacccac acactcaggg 8820 
cacaacacac cgaaagactc acatagacta 8880 
cctccggcgt ccttccgatg aagaaatcat 8940 
gtgcgctcca aaacggtgcc ttcaggaagg 9000 

9024 



tgggagagga tttggagcat tggggtgctc 60 
aaaaacaaac cgaggatatt caaaaagtct 120 
agtctactcc catgtgaagt ttcatgaaaa 180 
cagacaaaaa attcttatgc acagaaaaaa 240 
ttttcttcca tggatacatg tcatggtatt 300 
tgggcatcca agtttgatat ccccatattc 360 
tgaaattaat attcgtatag gggtggagca 420 
tgtatggtta tttaaaaaaa aactcgtaca 480 
ctagctgatt ttgtgtgcag tttcccataa 540 
ttcctcctac cttctgctct gtcttccgct 600 
tcaaaagaaa acaatatact tgctggaaaa 660 
caccaccgtc cgttactggt ccacctgcat 720 
cgaacaaaaa aaccaaacga aaataaagga 780 
ggacgaagga gcctttttgg tgcgcagatc 840 
gttctttccc cgtgaggagg ggaggctagg 900 
cgctgcgtgg tggactgact ggtgtggtgg 960 
gttcccggaa atggagatgg ctctccggcc 1020 
gccgctcgtc gtcgtccggc cggccggccg 1080 
tatggttctt gattcggtcg gttcacggaa 1140 
ggttcatagt gattttattc cgcaaaaaaa 1200 
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tttt tcatat 


acrctcact ct 

\^ ^ \m W \^ ^ 




6600 














6660 

www w 




4* ^ -a ^ a 

u c cccgaaga 


+" ^ 3 ^ 




^ u ugu^c^wa L. 




67 20 


^ ^ 4^ ^ ^ 

uQuCuCuCCa 


egg ucgcccc 


^ n ^ a ^ o 2 ^ 
agccauagau 












tgccaccutc 


ggacaagaag 


cugcg uag u u 


ugg ucg uc t. u 


aac ugcu ugg 


p ii n 
o o ^ u 


ttgatttgga 


aggaacacaa 


caacaguccc 


ugaaggcaaa 


gccaacccc u 


ucgarcaagr 




tattagacgg 


atcaagtgtg 
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gtcactattt 


ggctaggtcg 


cttgccatcc 


cgctctgtgc 


taagcgcttg 


gggtcgcttt 


7020 


tgctcaattt 


gtattttgtt 


gttatgtgtt 


tttagtaatg 


taacctgaac 


tttctggact 


7080 


aagtagaaaa 


aaattctcct 


ccataatgat 


cacatacagt 


tctcctgcat 


ggttcgaaaa 


7140 


aaaaatgaga 


acatccgtgg 


caagtttaag 


caccaccggt 


gcatttttac 


ctcaaagtta 


7200 


tatacaacac 


tgacatgccg 


aattacatgc 


tttggtcagt 


tattccattc 


ttcggtactc 


7260 


cgttgggcta 


attctttctc 


ttcatgttgc 


atgcagggcc 


gtggccctgt 


agatgaattc 


7320 


ccgttcaccg 


agttgcctga 


gcactacctg 


gaacacttca 


gactgtacga 


ccccgtgggt 


7380 


ggtgaacacg 


ccaactactt 


cgccgccggc 


ctgaagatgg 


cggaccaggt 


tgtcgtggtg 


74 40 


agccccgggt 


acctgtggga 


gctgaagacg 


gtggagggcg 


gctgggggct 


tcacgacatc 


7500 


atacggcaga 


acgactggaa 


gacccgcggc 


atcgtcaacg 


gcatcgacaa 


catggagtgg 


7560 


aaccccgagg 


tggacgccca 


cctcaagtcg .gacggctaca 


ccaacttctc 


cctgaggacg 


1 C O rt 

7 oZO 


ctggactccg 


gcaagcggca 


gtgcaaggag 


gccctgcagc 


gcgagctggg 


cct gcaggtc 


T c Q n 


cgcgccgacg 


tgccgctgct 


cggcttcatc 


ggccgcctgg 


acgggcagaa 


gggcgtggag 


1 / hu 


atcatcgpgg 


acgccatgcc 


ctggatcgtg 


agccaggacg 


tgcagctggt 


gatgctgggc 


"7 Q n n 


accgggcgcc 


acgacctgga 


gagcatgctg 


cggcacttcg 


agcgggagca 


ccacgacaag 
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/ o o u 


gtgcgcgggt 


gggtggggtt 


ctccgtgcgc 


ctggcgcacc 
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gggggcggac 
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tgccctcccg 


gttcgagccg 
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gcacgccgtt 


ggcggcctca 






804 0 


s ^ ^ 4 

gaccccucca 


accaccccgg 


gctcgggtgg 


acgttcgacc 






8100 


aucgaggcgc 


ccgggcac ug 


cctccgcacc 


taccgagact 






8160 


ctccaggagc 


gcggcatgtc 


gcaggacttc 


agctgggagc 


acgccgccaa 


gctctacgag 


8220 


gacgtcctcg 


tcaaggccaa 


gtaccagtgg 


tgaacgctag 


ctgctagccg 


ctccagcccc 


8280 


gcatgcgtgc 


atgacaggat 


ggaactgcat 


tgcgcacgca 


ggaaagtgcc 


atggagcgcc 


8340 


ggcatccgcg 


aagtacagtg 


acatgaggtg 


tgtgtggttg 


agacgctgat 


tccaatccgg 


8400 


cccgtagcag 


agtagagcgg 


aggtatatgg 


gaatcttaac 


ttggtattgt 


aatttgttat 


8460 


gttgtgtgca 


ttattacaat 


gttgttactt 


attcttgtta 


agtcggaggc 


caagggcgaa 


8520 
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aaaggtttat 


agtgattttg 


atttctttca 


tctcgggaac 


atttttatat 


ctgggagtca 


1260 


aagggcattg 


gttttgattt 


gcatgcggaa 


catattggtt 


atttattaat 


gtggtgagct 


1320 


ggaattcata 


ctgcttaaaa 


cgacgtgatt 


ttaattgctg 


gaagaggtaa 


agaacatgaa 


1380 


ttctgttata 


tttgttaaaa 


aaaatcccct 


gttctagcqt 


ttcagtctgc 

7 ^ 


atgatcatgg 


1440 


aaatat taat 


gttaatqctg 


gttaatttgg 


agtgaagatt 


tccacggcaa 


gagtttcgaa 

WW WW'^ 


1500 


caagaaacag 


aaat tcatta 


cgaaaaaatg 


gtggaacgaa 


t tcggagagt 


att tacattg 


1560 


t.ctacacc'bti 


atatatttQt 


gatgaagtta 


tttccatata 


ttttttgcga 


taaagttact 


1620 


t cccT'taliota 


aoQcaaa cat 


taccatcttt 


ctataaQcto 


atatttatct 


gccagat age 


1680 
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ttccactaat 


attttttaac 


Qaaacaaaac 


t atgaacaco' 


1740 
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^ X w V/ 
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^ ^ A ^ a 9 ^ A A 

ugacao. ugaa 


uac ug cag u u 
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^uauguuugg 








W B Vv WW ^ W ^ M 


caaocjcct t ti 

(A V4 w www 


2340 




•t- r* A fTl"t- t* 

Cl^ k« L» L> w l« w 




b» W w 


ttttttaata 


ctacaaaaac 


2400 


w O w u M ^ ^ w 


tcrt'ricra't" flCt* 


a'tacT'tti ta'ta 

^4 W V4 ^» ^ w w 


a'tac'tacaat 


tt ttattaca 


gccaaacacc 


2460 


^caaaQt.a'tti 


't a a aac ca ti a 


atttt taqaa 


aaact gtagt 


at ccttgaaa 


tactttgaga 


2520 


atactttaca 


acaaaacaca 


gcccagatgt 


tctgttaact 


tcatgtcttt 


ccaaattgca 


2580 


tcattcacrat 


cctcctaata 


ggaaatcaag 


aaagatggta 


tcacctcagg 


ttaaagtcat 


2 64 0 


ttcttctaga 


QcratatiacQa 


caagactcat 


tgttgaacca 


agcaccgaga 

7 ^ ^ 


atatagaaca 


2700 


caataatcgg 


aataaacraaa 


ctcttgatac 


atacaatgcg 


ctattaagta 


ccgagacagc 


2760 


agaatggaca 


gatactagag 


aagccgagac 


tgctaaagcg 


gactcgtcgc 


aaaacgcuC u 


•5 Q 5 n 


aagcagttct 


ataatcgggg 


gagtggatgt 


ggcggatgaa 


gatatacttg 


cggctgatct 


2880 


gacagtgaat 


tcattaagca 


gtataacgaa 


gaaggaagtg 


gatgcagtgg 


acaaagctag 


2940 


agttaaagaa 


gacgtatttg 


agctggattt 


gccagcaact 


acattgagaa 


gtgtgatag t 


3000 


ggatgtgatg 


gatcataatg 


ggactgtaca 


agagacat tg 


agaagtgtga 


tagtagatgt 


3060 


gatggatgat 


gcggcggaca 


aagctagagt 


tgaagaagac 


gtatttgagc 


tggatttgtc 


3120 
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aggaaatatt tcaagcagtg cgacgaccgt ggaactagat gcggttgacg aagtcgggcc 3180 
tgttcaagac acatttgagg cgaactcgtc aggaaatgtt tcaaacagtg caacggtacg 3240 
ggaagtggat acgagtgctg aagctgggaa tgatcaaggc atatttagag cagatttgtc 3300 
aggaaatgtt ttttcaagca gtacaacagt ggaagtgggt gcagtggatg aagctgggtc 3360 
tataaaagac aggtttgaga cggattcgtc aggaaatgtt tcaacaagtg cgacgatgtg 3420 
ggatgcaatt gatgaaaccg tggctgatca agacgcagtt gaggcggatt tgtcgggaaa 34 80 
tgcttcaagc tgcgcgacat acagagaagt ggatgatgtg gtggatgaaa ctagatcaga 354 0 
agaggaaaca tttgcgatgg atttgtttgc aagtgaatca ggccatgaga aacatatggc 3600 
agtggatcat gtgggtgaag ctaccgatga agaagagact taccaacagc aatatccagt 3660 
accgtcttca ttctctatgt gggacaaggc tattgctaaa acaggtgtaa gtttgaatcc 3720 
tgagctgcga cttgtcaggg ttgaagaaca aggcaaagta aattttagtg ataaaaaaga 3780 
cctgtcaatt gatgatttac caggacaaaa ccaatcgatc attggttcct ataaacaaga 3840 
taaatcaatt gctgatgttg cgggaccgac ccaatcaatt tttggttcta gtaaacaaca 3900 
ccggtcaatt gttgctttcc ccaaacaaaa ccagtcaatt gttagtgtca ctgagcaaaa 3 9 60 
gcagtccata gttggattcc gtagtcaaga tctttcggct gttagtctcc ctaaacaaaa 4020 
cgtaccaatt gttggtacgt cgagagaggg tcaaacaaag caagttcctg ttgttgatag 4080 
acaggatgcg ttgtatgtga atggactgga agctaaggag ggagatcaca catccgagaa 4140 
aaccgatgag gatgtgcttc atgtaaaatt taatgttgac aatgtgttgc ggaagcatca 4200 
ggcagataga acccaagcag tggaaacgat aacttggaag aaagttgatg aggaacatct 4260 
ttacatgact gaacatcaga taggtgctgc cgaaggacag atggtagtta acgaggatga 4 320 
gctttctata actgaaattg gaatggggag aggtgataaa attcagcatg tgctttctga 4380 
ggaagagctt tcatggtctg aagatgaagt gcagttaatt gaggatgatg gacaatatga 44 40 
agttgatgag acctctgtgt ccgttaacgt tgaacaagat atccaggggt caccacagga 4 500 
tgttgtggat ccgcaagcac taaaggtgat gctgcaagaa ctcgctgaga aaaattattc 4560 
gatgaggaac aagctgtttg tttttccaga ggtagtgaaa gctgattcag ttattgatct 4 620 
ttatttcaat cgtgacctaa cagctttggc gaatgaaccc gatgttgtca tcaaaggagc 4 680 
attcaatggt tggaaatgga ggcttttcac tgaaagattg cataagagtg accttggagg 4740 
ggtttggtgg tcttgcaaac tgtacatacc caaggaggcc .tacagattag actttgtgtt 4 800 
cttcaacggt cgcacggtct atgagaacaa tggcaacaat gatttctgta taggaataga 4860 
aggcactatg aatgaagatc tgtttgagga tttcttggtt aaagaaaagc aaagggagct 4 920 
tgagaaactt gccatggaag aagctgaaag gaggacacag actgaagaac agcggcgaag 4 980 
taaggaagca agggctgcag atgaagctgt cagggcacaa gcgaaggccg agatagagat 5040 
caagaacaaa aaattgcaga gtatgttgag tttggccaga acatgtgttg ataatttgtg 5100 
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gtacatagag gctagcacag atacaagcgg agatactatc aggttatact ataacagaaa 5160 
ctcgaggcca cttgcgcata gtactgagat ttggatgcat ggtggttaca acaattggtc 5220 
agatggactc tctattgttg aaagctttgt caagtgcaat gacagagacg gcgattggtg 5280 
gtatgcagat ggtacgacac ctcaaccttt gtacataagg caacattgtt ttgatttttt 5340 
ttgttgagga aacatttgtt ttgattctag cataatgctc ctacaaatat ggcatgaatt 5400 
tccttgtttt attgatgtca tgagaaagta ttttattaac tcgaaggcca tggaagctca 54 60 
acatttacca tagacagacg cttaaagatc atttgtattc cgtggatcat atatgtaatg 5520 
taatacctgt cttttctcta tatgtacagt tattccacct gaaaaagcac ttgtgttgga 5580 
ctgggttttt gctgatgggc cagctgggaa tgcaaggaac tatgacaaca atgctcgaca 5640 
agatttccat gctattcttc caaacaacaa tgtaaccgag gaaggcttct gggtgcaaga 5700 
ggagcaaaac atctatacaa ggcttctgca agaaaggaga gaaaaggaag aaaccatgaa 57 60 
aagaaaggtg agttgcaaca aaatctttgc atatgatctc tataattttg gcagttaacc 5820 
cctgagtgat ggcaaatata ttccctttcg tctattttcc aaattcaaaa tgcatggttc 5880 
catgcaagct tatccaaaat cacttgataa tataccaatc acaacataac tttgtttacc 594 0 
ataagaacat tcctacttaa aatttgcaag gtaactccct ttcgaggctg gttggcttga 6000 
tgagtaactg gcaattaaca aagaaaagat atatctgatg tttggaacaa aacatatgat 6060 
cagggttgtt tgggttgact catgttcctt tttacctacd caggctgaga gaagtgcaaa 6120 
tatcaaagct gagatgaagg caaaaactat gcgaaggttt ctgctttccc agaaacacat 6180 
tgtttatacc gaaccgcttg aaatacgtgc cggaaccaca gtggatgtgc tatacaatcc 6240 
ctctaacaca gtgctaaatg gaaagccgga ggtttggttt agatgctctt ttaacctttg 6300 
gatgcatcca agtggagcat tgccacccca gaagatggtg aaatcagggg atgggccgct 63 60 
cttaaaagcc acaggtttat tgcgttatta catcactgtt attagtatat atataaccat 6420 
ttttatgcaa tcaatagagt caagtgcaac taatgatgca cagataggat ccaatatttc 6480 
ttgttctatt attggtaata attagctagt ttaatgccat aagc:.:cataa cagatatgca 65^0 
actactccct ccaatccata ttacttgtcg caactttggt acaactttag tacaaagtta 6600 
tactaaagct gtgacaagta atatggaccg gagggagtac tatataagct tgtagctgtt 6660 
ttgagaccga gtgtctgctc gggtggctag ctggagcggg ctgaagtgct tgcaggcacc 6720 
tcttctctaa aaaaaagtgc ttgcagcccc cccgccccct ccatagggtg agtggtcacc 6780 
tttcttctta aaaattatgg caccaaggga aattctcggc tggtcgagct tgtagctatt 6840 
ttttcggagc gtgaatggga gcgtctttct gtataaggcc tataggctta ctttgatata 6900 
tattgtgaag tcacttaagc cttgttaaaa cgtagaaact tagttccgca acttggccaa 6960 
atccctgtta aattggttta ctgtgtacta gatgcatcga tggcgcagag tccggggggt 7020 
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aataaagctt ccattttcta caatgaagtt aattatccta cttgccttgt aattactgag 7080 
tacaatacag agcaccgaaa agctgtatcc ttcctacttc cttatgttta tctgtgttcc 7140 
ttgtctagtt aatgttccac cggatgccta tatgatggac tttgttttct ccgagtggga 7200 
agaagatggg atctatgaca acaggaatgg gatggactat catattcctg tttctgattc 7260 
aattgaaaca gagaattaca tgcgtattat ccacattgcc gttgagatgg cccccgttgc 7320 
aaaggtaata taattctaag gctagtttct ttgatgcgag gcgagatctc atcaccttat 7380 
gccttttttt cattctatgc cataatacta .tgctctgtca tgatcgatga tctcataggt 7440 
tggaggtctc ggggatgttg ttacaagtct ttcacgtgcc gttcaagatc tagggcatac 7500 
tgtcgaggtt attctcccga agtacgactg tttgaaccaa agcagtgtaa gttgaagtac 7560 
tgtactacat aatctattca cttagtcttt aaaatttcaa ctcaaaatgc cacgaagctt 7620 
caactgaagc taaagaattc tgagctgcga tggagcgcag tagggtggca cagatcccaa 7680 
taaaccaata catgaccaat aagggggtgc caagatcagt aggcactaat gaatttcctt 7740 
tgttttatat ccattataca ttattaatca agttacatct atttcaatgc aggtcaagga 7800 
tttacattta tatcaaagtt tttcttgggg tggtacagaa ataaaagtat gggttggacg 7860 
agtcgaagac ctgaccgttt acttcctgga acctcaaaat gggtatgaat cagctaatgt 7 920 
atagtttttt ttgtgggaaa tgtatagttg agtgatataa aacatattac ttcttttcac 7980 
aaaattatta ggctagagcc ttgtactggt taataatgtg tacctttttc tcattcatat 8040 
aactacttat cgtagactat agaagccaat tagtaacaca atacattggc cttggcattc 8100 
caggctgaga gctagttata acaatgatat gtgagattag tggctctata accacttttg 8160 
agctaaagga atttgctgct agatgagcca atcaatccaa ctaattttaa attccatgat 8220 
caccctagga cacgcagcct gcacaaccaa gaacacagct aagatcatcg cgtgggcaca 8280 
aaaggttgtg cattaaggct aggccctggt cagtggctgt caaggactcc atggggctcc 8340 
ttacagtttt tattctgata tctcttgcgc ccatatgacg ctaccaaacg cttgtaacct 8400 
gtagcaaact attgccatct gtcactcaat gataaggtag acaatctttc ctttcccttt 8460 
aagatgttca acctttattt atgcttgagg atgcgtttga ttgtcaaatt tcagtttctc 8520 
tagattgcag acacacttgc acgtgctgtg tacaccttcc attatctggc atgggatttg 8580 
catttcaatt aagagaaata tgaaagaaag aaatgttatc acctgaatgt tagagcttaa 8640 
aaggcacaag caatcagcac catttatcaa aaataaatga tttacttgtc tagttgtctc 8700 
tttttggttc tcttcctgta agtggatgcc aatatctcaa gaactctcct gaggattttt 8760 
cttcacaacc tattcatttt gacatttcct tttctaggat gtttggcgtc ggatgtgtat 8820 
atggaaggaa tgatgaccgc agatttgggt tcttctgtca ttctgctctt gagtttatcc 8880 
tccagaatga attttctcca gtacgtatta tttagaatac tagctgctat attgactttt 8940 
tctttgtgag actacacttt cttgtttacc attccagtgc accatgttca aaatcttgta 9000 
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ttcagcgcgt tactttcagt ttctttacta 
tcctactcta ctatctgaat gctacttgtg 
ccatttctca gttaaaaaaa cttgcatctg 
atgattggtc aagtgctccg gtcgcctggc 
tggcaagcac tcgggttgta tttaccatcc 
gtaaagcaat gacatactgt gataaagcca 
tttaatcttt ctgtttggcg cacagaaaat 
gttttgtacg tattcaacta tttccaccca 
ggacgtggca ggccatggtg ccattgctcc 
tggaattgat ccagatatct gggatccata 
tttcccagag tgcaagtaga tatataccaa 
gtttcatact acttcaggtc ccttatactt 
caaaaagggc cttgcagcag aagtttggat 
tcatcacccg tctgacagcc cagaagggaa 
ccctcgaaag caacggacag gttcatcatc 
gtttataaaa agttgcttac tatttgtttt 
aagtgtttgg ttcctaggtg gttttgcttg 
atttttgcag attggccgat gctcttcacg 
taacctacga tgagcctctt tctcacctgg 
cagcccttca ttatgggagc tggagactac 
tgcagatata cgctggctcc gacttcatta 
taacacaact tgttgccatg cgttatggat 
tgtgactatt tctctccatt atgctgcact 
aatggaaact atcctttggt atcgcaggac 
ataaggaccg ggctcggtct cttggtcttg 
acagcaacgg cgtggattat gccctcaaca 
attcagcagt agtgctaggt tatttacctt 
ggcgcttggt tcgatgcccg tgattggttc 
gactggtcat ggaaccggcc cgcactggac 
ttctgacacc caactgaacc aatggcaaga 
tacagggctg tgcagatcgt cttgcttcag 
gctgaggatc ctcttgcctc ctccaccagg 
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ctagcttatt 


tggtgcattg 


gtgtttcctt 


9060 


ttttcgcaac 


agttgcttct 


ttatcccctt 


9120 


tattcacgtg 


acagcatata 


atacat tgcc 


9180 


tatataagga 


acactattcc 


caatccagaa 


9240 


acaatcttga 


atttggagca 


cattatattg 


9300 


caactgtgag 


tgccttactg 


tcttgtaatt 


9360 


cttccacatt 


ttacagaatc 


atgttcttgt 


9420 


aacttttcag 


gtttctccta 


catattcaag 


9480 


tcatcgtgag 


aaattctacg 


gcattctcaa 


9540 


cactgacaat 


tttatcccgg 


taccagattt 


9600 


ggccacagat 


agttttatgc 


ttaactatgt 


9660 


gtgagaatgt 


tgtcgaaggc 


aagagagctg 


9720 


tacagcaaac 


tgatgtccct 


attgtcggaa 


9780 


tccacctcat 


caagcacgca 


attcaccgaa 


9840 


ccttgtgaac 


gaataaacat 


caaacgtttt 


9900 


tgtttacttc 


aaaacaaaag 


tctgaaaatg 


9960 


gttcagctcc 


agatcatcga 


atacaaggcg 


10020 


gtgtttacca 


cggtagggtg 


aagcttgttc 


10080 


tgagctccaa 


tatcctacac 


accatctagc 


10140 


tttataattt 


aggttgatga 


tcgatcatgc 


10200 


ttgtcccttc 


aatcttcgaa 


ccctgtggct 


10260 


cgatccctat 


agttcggaaa 


accggaggtg 


10320 


gatttgcata 


tgtcgagctg 


ttggacatga 


10380 


tttacgacac 


tgtcttcgac 


gtagacaatg 


10440 


aaccaaatgg 


gttcagtttc 


gacggagccg 


10500 


gagcaagtat 


cgttcctcaa 


ttagccctga 


10560 


gcatgttcca 


tacctcattt 


cagagcaatc 


10620 


cactccctgt 


gtaagagggt 


catggaacaa 


10680 


tacattgaat 


tgtaccatgc 


cgctcgaaaa 


10740 


acaagcgcat 


tgtgggatcg 


actacagtca 


10800 


ttagttccaa 


gcgcactgca 


gtcgtacata 


10860 


gggaacaaag 


cagaaatgca 


tgagtgcatt 


10920 
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gggaagactt ttatgtatat tgttaagatt ttccttttct tttccttccc tgcacctgga 10980 
aatggttaag cgcatcggca atataagaac cgcagtgaca ttttgtgagt agctttgtat 11040 
attctctcat cttgtgcaaa cttatgtgca tgctaggctc tctgatcatg tggaagcttt 11100 
gttatatgtt acttatggta tacatcaatg atatttacat ttgtggatga gctactgcac 11160 
ttggtttctg ctatctgttt tgtgaaatgg cagggccatg attatgcaga ttcactggtt 11220 
ctgaaacaga cacgctcctc taagctgtga ctgtgagctc tgaaaacagc attgttaaca- 11280 
tctattagta taaactaagg tacatcaacg gtgaagattt acgagctaaa ctccgtttgg 11340 
ttgtagacat tcactagaag tataagcgcg cttttctgcg ccgcctaggc tgcaatgatt 11400 
ttttttttat gtgtgtgtgg atatttcact atgacctgtg ggcaaaaggc tggccgagat 114 60 
ttaggaagcg ctcaagcaat tggccaatgg gaaggtgccg gccctgatgg fttcacggcc 11520 
cagttcttgc gctcctgctg ggatatcatc aagggagatc gagaattccc gggatccgcg 11580 



gccgcgagct tccctatagt gagtcgtatt a 

<210> 39 
<211> 11 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : PEPTIDE 
<400> 39 

hys Val Giy Giy Leu Giy Asp Val Val Thr Ser 
^ 5 10 

<210> 40 
<211> 11 
<212> PRT 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence : PEPTIDE 
<400> 40 

Giy His Thr Val Glu Val He Leu Pro Lys Tyr 
^ 5 . 10 

<210> 41 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : PEPTIDE 
<400> 41 

His Asp Trp ser Ser Ala Pro Val Ala Trp Leu Tyr Lys Glu His Tyr 

5 10 15 

<210> 42 



11611 
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<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : PEPTIDE 
<400> 42 

Gly He Leu Asn Gly He Asp Pro Asp He Trp Asp Pro Tyr Thr Asp 
1 5 10 15 

<210> 43 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : PEPTIDE 
<400> 43 

Asp Val Pro He Val Gly He He Thr Arg Leu Thr Ala Gin Lys Gly 
15 10 15 

<210> 44 
<2H> 10 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : PEPTIDE 
<400> 44 

Asn Gly Gin Val Val Leu Leu Gly Ser Ala 
15 10 

<210> 45 
<211> 27 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : PEPTIDE 
<400> 45 

Ala Gly Ser Asp Phe He He Val Pro Ser He Phe Glu Pro Cys Gly 
15 10 15 

Leu Thr Gin Leu Val Ala Met Arg Tyr Gly Ser 

20 25 



<210> 46 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : PEPTIDE 
<400> 46 

Thr Gly Gly Leu Val Asp Thr Val 
1 5 
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<210> 41 
<211> 11 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : PEPTIDE 
<400> 47 

Lys Thr Gly Gly Leu Gly Asp Val Ala Gly Ala 
^ 5 10 



<210> 48 
<211> 11 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : PEPTIDE 



<400> 48 

Gly His Arg Val Met Val Val Val Pro Arg Tyr 
^ 10 



<210> 49 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : PEPTIDE 
<400> 49 

Asn Asp Trp His Thr Ala Leu Leu Pro Val Tyr Leu Lys Ala Tyr Tyr 

<210> 50 
<2H> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

•-223> Description of Artificial Sequence : PEPTIDE 
<400> 50 

Gly He Val Asn Gly He Asp Asn Met Glu Trp Asn Pro Glu Val Asp 



10 



<210> 51 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : PEPTIDE 
<400> 51 

Asp Val Pro Leu Leu Gly Phe 11 



15 



e Gly Arg Leu Asp Gly Gin Lys Glv 
10 15 
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<210> 52 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : PEPTIDE 
<400> 52 

Asp Val Gin Leu Val Met Leu Gly Thr Gly 
1 5 10 

<210> 53 
<211> 27 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PEPTIDE 
<400> 53 

Ala Gly Ala Asp Ala Leu Leu Met Pro Ser Arg Phe Xaa Pro Cys Gly 
15 10 15 

Leu Asn Gin Leu Tyr Ala Met Ala Tyr Gly Thr 

20 25 

<210> 54 
<211> 8 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : PEPTIDE 
<400> 54 

Val Gly Gly Xaa Arg Asp Thr Val 

1 5 . . 
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